...
首页> 外文期刊>Procedia Computer Science >An Autonomous Distal Reward Learning Architecture for Embodied Agents
【24h】

An Autonomous Distal Reward Learning Architecture for Embodied Agents

机译:面向实体代理的自主远程奖励学习架构

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Distal reward refers to a class of problems where reward is temporally distal from actions that lead to reward. The difficulty for any biological neural system is that the neural activations that caused an agent to achieve reward may no longer be present when the reward is experienced. Therefore in addition to the usual reward assignment problem, there is the additional complexity of rewarding through time based on neural activations that may no longer be present. Although this problem has been thoroughly studied over the years using methods such as reinforcement learning, we are interested in a more biologically motivated neural architectural approach. This paper introduces one such architecture that exhibits rudimentary distal reward learning based on associations of bottom-up visual sensory sequences with bottom-up proprioceptive motor sequences while an agent explores an environment. After sufficient learning, the agent is able to locate the reward through chaining together of top-down motor command sequences. This paper will briefly discuss the details of the neural architecture, the agent-based modeling system in which it is embodied, a virtual Morris water maze environment used for training and evaluation, and a sampling of numerical experiments characterizing its learning properties.
机译:远期奖励是指一类问题,其中奖励在时间上远超导致奖励的行动。任何生物神经系统的困难在于,当获得奖励时,导致代理获得奖励的神经激活可能不再存在。因此,除了通常的奖励分配问题外,还有基于神经激活的时间奖励的额外复杂性,这种复杂性可能不再存在。尽管多年来已经使用诸如强化学习之类的方法对这一问题进行了深入研究,但我们对更具生物学动机的神经体系结构方法感兴趣。本文介绍了一种这样的体系结构,该体系结构在探员探索环境时,基于自下而上的视觉感觉序列与自下而上的本体感受运动序列之间的关联,展示了基本的远端奖励学习。经过充分学习后,代理可以通过将自上而下的运动命令序列链接在一​​起来找到奖励。本文将简要讨论神经体系结构的细节,包含该体系结构的基于代理的建模系统,用于训练和评估的虚拟莫里斯水迷宫环境以及表征其学习特性的数值实验样本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号