首页> 外文会议>Annual International Conference of the IEEE Engineering in Medicine and Biology Society >Maximum correntropy based attention-gated reinforcement learning designed for brain machine interface
【24h】

Maximum correntropy based attention-gated reinforcement learning designed for brain machine interface

机译:专为脑机接口设计的基于最大熵的注意门控强化学习

获取原文

摘要

Reinforcement learning is an effective algorithm for brain machine interfaces (BMIs) which interprets the mapping between neural activities with plasticity and the kinematics. Exploring large state-action space is difficulty when the complicated BMIs needs to assign credits over both time and space. For BMIs attention gated reinforcement learning (AGREL) has been developed to classify multi-actions for spatial credit assignment task with better efficiency. However, the outliers existing in the neural signals still make interpret the neural-action mapping difficult. We propose an enhanced AGREL algorithm using correntropy as a criterion, which is more insensitive to noise. Then the algorithm is tested on the neural data where the monkey is trained to do the obstacle avoidance task. The new method converges faster during the training period, and improves from 44.63% to 68.79% on average in success rate compared with the original AGREL. The result indicates that the combination of correntropy criterion and AGREL can reduce the effect of the outliers with better performance when interpreting the mapping between neural signal and kinematics.
机译:强化学习是一种针对脑机接口(BMI)的有效算法,该算法可解释具有可塑性的神经活动与运动学之间的映射。当复杂的BMI需要在时间和空间上分配信用时,探索大型的国家行动空间是困难的。对于BMI,注意力控制强化学习(AGREL)已被开发出来,可以更好地对空间信用分配任务的多动作进行分类。然而,神经信号中存在的离群值仍然使解释神经作用图变得困难。我们提出了一种以熵为准则的增强型AGREL算法,该算法对噪声更不敏感。然后,在训练猴子完成避障任务的神经数据上对该算法进行测试。新方法在训练期间收敛更快,与原始AGREL相比,成功率从44.63%提高到68.79%。结果表明,在解释神经信号与运动学之间的映射时,结合熵准则和AGREL可以更好地降低离群值的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号