...
首页> 外文期刊>PLoS Computational Biology >Learning from Sensory and Reward Prediction Errors during Motor Adaptation
【24h】

Learning from Sensory and Reward Prediction Errors during Motor Adaptation

机译:从电机适应过程中的感官和奖励预测错误中学习

获取原文
           

摘要

Voluntary motor commands produce two kinds of consequences. Initially, a sensory consequence is observed in terms of activity in our primary sensory organs (e.g., vision, proprioception). Subsequently, the brain evaluates the sensory feedback and produces a subjective measure of utility or usefulness of the motor commands (e.g., reward). As a result, comparisons between predicted and observed consequences of motor commands produce two forms of prediction error. How do these errors contribute to changes in motor commands? Here, we considered a reach adaptation protocol and found that when high quality sensory feedback was available, adaptation of motor commands was driven almost exclusively by sensory prediction errors. This form of learning had a distinct signature: as motor commands adapted, the subjects altered their predictions regarding sensory consequences of motor commands, and generalized this learning broadly to neighboring motor commands. In contrast, as the quality of the sensory feedback degraded, adaptation of motor commands became more dependent on reward prediction errors. Reward prediction errors produced comparable changes in the motor commands, but produced no change in the predicted sensory consequences of motor commands, and generalized only locally. Because we found that there was a within subject correlation between generalization patterns and sensory remapping, it is plausible that during adaptation an individual's relative reliance on sensory vs. reward prediction errors could be inferred. We suggest that while motor commands change because of sensory and reward prediction errors, only sensory prediction errors produce a change in the neural system that predicts sensory consequences of motor commands.
机译:自愿运动命令会产生两种后果。最初,在我们的主要感觉器官的活动(例如,视觉,本体感受)方面观察到了感觉结果。随后,大脑评估感觉反馈并产生对运动命令的效用或有用性的主观测量(例如,奖励)。结果,电动机命令的预测结果与观察到的结果之间的比较产生了两种形式的预测误差。这些错误如何导致电机命令发生变化?在这里,我们考虑了范围适应协议,发现当高质量的感觉反馈可用时,运动命令的适应几乎完全由感觉预测误差来驱动。这种学习形式具有鲜明的特征:随着运动命令的适应,受试者改变了对运动命令的感觉后果的预测,并将这种学习广泛地推广到相邻的运动命令。相反,随着感觉反馈质量的下降,运动命令的适应性变得越来越依赖于奖励预测误差。奖励预测误差在运动指令中产生了可比的变化,但在运动指令的预测感官结果中未产生变化,并且仅局部推广。因为我们发现泛化模式和感官重映射之间存在主题相关性,所以在适应过程中可以推断出个人对感官对奖励预测错误的相对依赖。我们建议,虽然运动命令由于感觉和奖励预测错误而改变,但只有感觉预测错误会在神经系统中产生变化,从而预测运动命令的感觉后果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号