【24h】

A reinforcement learning model for macaque monkey tool-use

机译:猕猴猴子工具使用的加强学习模型

获取原文
获取原文并翻译 | 示例
           

摘要

Iriki et.al. have shown that although Japanese monkeys took about two weeks to be able to use a rake to get distant food, they could immediately use two rakes to get more distant food. They also found neurons in caudal postcentral gyrus, whose visual receptive field is enlarged with using a rake. This study presents a model based on the reinforcement learning theory in order to explain the monkeys' behavior. [MODEL] This model is based on a hypothesis that monkeys can evaluate tools according to their distance to food. The model consists of a conventional Actor-Critic model and a tool-evaluation-component. [RESULTS] Computer simulation reproduced the above experimental data. This verifies the hypothesis that monkeys can evaluate tools.
机译:iriki et.al. 已经表明,虽然日本猴子大约需要两周时间才能使用耙子来获得遥远的食物,但他们可以立即使用两次耙子来获得更多的食物。 他们还在尾部后间隙的神经元发现,使用耙子的视觉接收领域的视觉接收领域被扩大。 本研究提出了一种基于加强学习理论的模型,以解释猴子的行为。 [模型]该模型基于假设,即猴子可以根据其与食物的距离评估工具。 该模型包括传统的演员 - 评论家模型和工具评估组件。 [结果]计算机仿真再现上述实验数据。 这验证了猴子可以评估工具的假设。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号