...
首页> 外文期刊>PLoS Computational Biology >How pupil responses track value-based decision-making during and after reinforcement learning
【24h】

How pupil responses track value-based decision-making during and after reinforcement learning

机译:在强化学习期间和之后,学生的反应如何跟踪基于价值的决策

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Author summary It has long been known that the pupil dilates when we decide. These pupil dilations have predominantly been linked to arousal. However, reward-related processes may trigger pupil dilations as well, as dilations have been linked to activity in the dopaminergic midbrain, a region important for reward processing and reinforcement learning. Using a learning task and a computational model to quantitatively describe the cognitive processes that drive reinforcement learning behavior, we show that the pupil closely tracks different aspects of the reinforcement learning process. Prior to making a value-based choice, pupil dilation reflected the value of the soon-to-be-chosen option. After receiving choice feedback, early dilation reflected uncertainty about the value of recent choice options, while late constriction reflected how strongly an outcome violated current value beliefs. These findings provide the novel insight that the pupil can be used to track value-based decision-making, opening up a new method for online tracking of reinforcement learning processes.
机译:作者摘要早就知道,当我们做出决定时,瞳孔会扩大。这些瞳孔扩张主要与唤醒有关。但是,与奖励相关的过程也可能触发瞳孔扩张,因为扩张与多巴胺能中脑的活动有关,该区域对奖励过程和强化学习很重要。使用学习任务和计算模型来定量描述驱动强化学习行为的认知过程,我们表明学生密切跟踪强化学习过程的不同方面。在做出基于价值的选择之前,瞳孔扩大反映了即将被选择的选择的价值。收到选择反馈后,早期扩张反映出近期选择权价值的不确定性,而后期收缩反映出结果违反当前价值信念的强烈程度。这些发现提供了新颖的见解,即学生可以用来跟踪基于价值的决策,从而开辟了一种在线跟踪强化学习过程的新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号