...
首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Quantized Attention-Gated Kernel Reinforcement Learning for Brain–Machine Interface Decoding
【24h】

Quantized Attention-Gated Kernel Reinforcement Learning for Brain–Machine Interface Decoding

机译:脑机接口解码的量化注意门控核增强学习

获取原文
获取原文并翻译 | 示例
           

摘要

Reinforcement learning (RL)-based decoders in brain-machine interfaces (BMIs) interpret dynamic neural activity without patients' real limb movements. In conventional RL, the goal state is selected by the user or defined by the physics of the problem, and the decoder finds an optimal policy essentially by assigning credit over time, which is normally very time-consuming. However, BMI tasks require finding a good policy in very few trials, which impose a limit on the complexity of the tasks that can be learned before the animal quits. Therefore, this paper explores the possibility of letting the agent infer potential goals through actions over space with multiple objects, using the instantaneous reward to assign credit spatially. A previous method, attention-gated RL employs a multilayer perceptron trained with backpropagation, but it is prone to local minima entrapment. We propose a quantized attention-gated kernel RL (QAGKRL) to avoid the local minima adaptation in spatial credit assignment and sparsify the network topology. The experimental results show that the QAGKRL achieves higher successful rates and more stable performance, indicating its powerful decoding ability for more sophisticated BMI tasks as required in clinical applications.
机译:人机界面(BMI)中基于增强学习(RL)的解码器可解释动态神经活动,而无需患者进行实际肢体运动。在传统的RL中,目标状态由用户选择或由问题的物理性质定义,并且解码器基本上通过随时间分配信用来找到最佳策略,这通常非常耗时。但是,BMI任务需要在极少的试验中找到好的策略,这限制了在动物退出之前可以学习的任务的复杂性。因此,本文探讨了使用即时奖励在空间上分配信用,让代理通过对具有多个对象的空间行为进行推断潜在目标的可能性。注意门控RL是一种先前的方法,它采用经过反向传播训练的多层感知器,但是它倾向于局部极小陷波。我们提出了一种量化的注意门控内核RL(QAGKRL),以避免空间信用分配中的局部最小值适应并稀疏网络拓扑。实验结果表明,QAGKRL具有较高的成功率和更稳定的性能,表明其强大的解码能力可满足临床应用所需的更复杂的BMI任务。

著录项

  • 来源
  • 作者单位

    Department of Biomedical Engineering, Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou, China;

    Key Laboratory of Biomedical Engineering, Qiushi Academy for Advanced Studies, Ministry of Education, Zhejiang University, Hangzhou, China;

    Department of Biomedical Engineering, Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou, China;

    Department of Biomedical Engineering, Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou, China;

    Department of Biomedical Engineering, Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou, China;

    Department of Biomedical Engineering, Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou, China;

    Department of Biomedical Engineering, Ministry of Education, Key Laboratory of Biomedical Engineering, Innovation Joint Research Center for Cyber-Physical-Society System, Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou, China;

    Department of Biomedical Engineering, Ministry of Education, Key Laboratory of Biomedical Engineering, Innovation Joint Research Center for Cyber-Physical-Society System, Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou, China;

    Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL, USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Kernel; Decoding; Trajectory; Animals; Biomedical engineering; Learning (artificial intelligence); Quantization (signal);

    机译:内核;解码;轨迹;动物;生物医学工程;学习(人工智能);量化(信号);

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号