首页> 外文会议>Conference on Neural Information Processing Systems >Interval timing in deep reinforcement learing agents

【24h】

Interval timing in deep reinforcement learing agents

机译：深增强学习代理中的间隔时间

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The measurement of time is central to intelligent behavior. We know that both animals and artificial agents can successfully use temporal dependencies to select actions. In artificial agents, little work has directly addressed (1) which architectural components are necessary for successful development of this ability, (2) how this timing ability comes to be represented in the units and actions of the agent, and (3) whether the resulting behavior of the system converges on solutions similar to those of biology. Here we studied interval timing abilities in deep reinforcement learning agents trained end-to-end on an interval reproduction paradigm inspired by experimental literature on mechanisms of timing. We characterize the strategies developed by recurrent and feedforward agents, which both succeed at temporal reproduction using distinct mechanisms, some of which bear specific and intriguing similarities to biological systems. These findings advance our understanding of how agents come to represent time, and they highlight the value of experimentally inspired approaches to characterizing agent abilities.

机译：时间的测量是智能行为的核心。我们知道，两种动物和人工代理都可以成功地使用时间依赖性来选择动作。在人工代理中，一点工作直接解决了（1）哪些架构组件是为了成功发展这种能力所必需的，（2）如何在代理人的单位和行动中代表这个时序能力，以及（3）是否产生的系统的行为会聚与生物学类似的解决方案。在这里，我们研究了深度加强学习代理中的间隔时间能力在通过实验文献的间隔再现范式上训练了训练的终端到底。我们表征了经常性和前馈代理制定的策略，两者都使用不同的机制成功地成功，其中一些与生物系统具有具体和有趣的相似性。这些调查结果推进了我们对代理人如何代表时间的理解，并且它们突出了实验激励方法的表征代理能力的价值。

著录项

来源
《Conference on Neural Information Processing Systems 》|2020年|p6363-7159|共10页
会议地点
作者
Ben Deverett; Ryan Faulkner; Meire Fortunato; Greg Wayne; Joel Z. Leibo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计量学 ;
关键词

相似文献

外文文献
中文文献
专利

1. Integrating Models of Interval Timing and Reinforcement Learning [J] . Elijah A. Petter, Samuel J. Gershman, Warren H. Meck Trends in cognitive sciences . 2018 ,第10期

机译：集间隔时间和强化学习模型
2. Selective Activation of a Putative Reinforcement Signal Conditions Cued Interval Timing in Primary Visual Cortex [J] . Liu Cheng-Hang, Coleman Jason E., Davoudi Heydar, Current Biology: CB . 2015 ,第12期

机译：选择性增强信号条件的选择性激活提示主视觉皮层中的间隔时间。
3. Active exploration is important for reinforcement learning of interval timing [J] . Osamu Shouno, Hiroshi Tsujino BMC Neuroscience . 2011 ,第SUPPLEMENTa1期

机译：积极探索对于强化间隔时间学习非常重要
4. Interval timing in deep reinforcement learing agents [C] . Ben Deverett, Ryan Faulkner, Meire Fortunato, Conference on Neural Information Processing Systems . 2020

机译：深增强学习代理中的间隔时间
5. Transfer in Deep Reinforcement Learning: How an Agent Can Leverage Knowledge from Another Agent, a Human, or Itself [D] . Du, Yunshu. 2021

机译：在深度加强学习中转移：代理人如何利用来自其他代理人，人类或本身的知识
6. Active exploration is important for reinforcement learning of interval timing [O] . Osamu Shouno, Hiroshi Tsujino 2011

机译：积极探索对于强化间隔时间学习很重要
7. Selective Activation of a Putative Reinforcement Signal Conditions Cued Interval Timing in Primary Visual Cortex [O] . Solomon H. Snyder Department of Neuroscience, The Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA ( host institution ), Liu, Cheng-Hang ( author ), Coleman, Jason E. ( UF author ), 2015

机译：选择性增强信号条件的选择性激活提示主视觉皮层中的间隔时间。

Interval timing in deep reinforcement learing agents

摘要

著录项

相似文献

相关主题

期刊订阅