首页> 外文会议>International Joint Conference on Neural Networks >Reward-based online learning in non-stationary environments: Adapting a P300-speller with a “backspace” key
【24h】

Reward-based online learning in non-stationary environments: Adapting a P300-speller with a “backspace” key

机译:在非固定环境中基于奖励的在线学习:使用“退格”键修改P300-Speller

获取原文

摘要

We adapt a policy gradient approach to the problem of reward-based online learning of a non-invasive EEG-based “P300”-speller. We first clarify the nature of the P300-speller classification problem and present a general regularized gradient ascent formula. We then show that when the reward is immediate and binary (namely “bad response” or “good response”), each update is expected to improve the classifier accuracy, whether the actual response is correct or not. We also estimate the robustness of the method to occasional mistaken rewards, i.e. show that the learning efficacy may only linearly decrease with the rate of invalid rewards. The effectiveness of our approach is tested in a series of simulations reproducing the conditions of real experiments. We show in a first experiment that a systematic improvement of the spelling rate is obtained for all subjects in the absence of initial calibration. In a second experiment, we consider the case of the online recovery that is expected to follow failed electrodes. Combined with a specific failure detection algorithm, the spelling error information (typically contained in a “backspace” hit) is shown useful for the policy gradient to adapt the P300 classifier to the new situation, provided the feedback is reliable enough (namely having a reliability greater than 70%).
机译:我们采用一种策略梯度方法来解决基于无创EEG的“ P300” -speller的基于奖励的在线学习问题。我们首先弄清P300-Speller分类问题的性质,并提出一个通用的正则化梯度上升公式。然后,我们表明,当奖励是立即和二进制的(即“不良响应”或“良好响应”)时,无论实际响应是否正确,每次更新都有望提高分类器的准确性。我们还估计了该方法对偶尔的错误奖励的鲁棒性,即表明学习功效可能仅随着无效奖励的比率线性降低。我们的方法的有效性在再现真实实验条件的一系列模拟中进行了测试。我们在第一个实验中表明,在没有初始校准的情况下,所有受试者的拼写率都得到了系统的提高。在第二个实验中,我们考虑了预计将在故障电极之后进行在线恢复的情况。结合特定的故障检测算法,显示拼写错误信息(通常包含在“退格”命中中)对于策略梯度使P300分类器适应新情况非常有用,前提是反馈足够可靠(即具有可靠性)。大于70%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号