首页> 外文会议>Annual International Conference of the IEEE Engineering in Medicine and Biology Society >A simulated environment for early development stages of reinforcement learning algorithms for closed-loop deep brain stimulation
【24h】

A simulated environment for early development stages of reinforcement learning algorithms for closed-loop deep brain stimulation

机译:用于闭环深部脑刺激的强化学习算法早期开发阶段的模拟环境

获取原文

摘要

In recent years, closed-loop adaptive deep brain stimulation (aDBS) for Parkinson’s disease (PD) has gained focus in the research community, due to promising proof-of-concept studies showing its suitability for improving DBS therapy and ameliorating related side effects.The main challenges faced in the aDBS control problem is the presence of non-stationaryon-linear dynamics and the heterogeneity of PD’s phenotype, making the exploration of data-driven dynamics-aware control algorithms a promising research direction. However, due to the severe safety constraints related to working with patients, aDBS is a sensitive research field that requires surrogate development platforms with growing complexity, as novel control algorithms are validated.With our current contribution, we propose the characterization and categorization of non-stationary dynamics found in the aDBS problem. We show how knowledge about these dynamics can be embedded in a surrogate simulation environment, which has been designed to support early development stages of aDBS control strategies, specifically those based on reinforcement learning (RL) algorithms. Finally, we present a comparison of representative RL methods designed to cope with the type of non-stationary dynamics found in aDBS.To allow reproducibility and encourage adoption of our approach, the source code of the developed methods and simulation environment are made available online.
机译:近年来,针对帕金森氏病(PD)的闭环自适应深部脑刺激(aDBS)已成为研究界的关注点,这是由于有前途的概念验证研究表明其适用于改善DBS治疗和缓解相关副作用。 aDBS控制问题面临的主要挑战是非平稳/非线性动力学的存在以及PD表型的异质性,这使得探索数据驱动的动态感知控制算法成为一个有前途的研究方向。但是,由于与患者工作相关的严格安全限制,随着验证新的控制算法,aDBS是一个敏感的研究领域,需要替代的开发平台变得越来越复杂。在aDBS问题中发现的平稳动力学。我们展示了如何将有关这些动力学的知识嵌入到替代模拟环境中,该模拟环境旨在支持aDBS控制策略的早期开发阶段,尤其是基于强化学习(RL)算法的控制策略。最后,我们对代表性的RL方法进行了比较,这些方法旨在应对aDBS中发现的非平稳动力学类型。为了允许重现性并鼓励采用我们的方法,在线提供了已开发方法和仿真环境的源代码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号