Reinforcement learning for discounted values often loses the goal in the application to animal learning

Yoshiya Yamaguchi; Yutaka Sakai

首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >Reinforcement learning for discounted values often loses the goal in the application to animal learning

【24h】

Reinforcement learning for discounted values often loses the goal in the application to animal learning

机译：折扣价值的强化学习常常失去了在动物学习中的应用目标

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The impulsive preference of an animal for an immediate reward implies that it might subjectively discount the value of potential future outcomes. A theoretical framework to maximize the discounted subjective value has been established in the reinforcement learning theory. The framework has been successfully applied in engineering. However, this study identified a limitation when applied to animal behavior, where in some cases, there is no learning goal. Here a possible learning framework was proposed that is well-posed in any cases and that is consistent with the impulsive preference.

机译：动物对即时奖励的冲动偏好意味着它可能在主观上低估了潜在未来结果的价值。在强化学习理论中已经建立了最大化折现主观价值的理论框架。该框架已成功应用于工程中。但是，这项研究发现了在应用于动物行为时的局限性，在某些情况下，这没有学习目标。在这里，提出了一个可能的学习框架，该框架在任何情况下都具有良好的条件，并且与冲动性偏好保持一致。

著录项

来源
《Neural Networks: The Official Journal of the International Neural Network Society》 |2012年第11期|共4页
作者
Yoshiya Yamaguchi; Yutaka Sakai;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类神经病学;
关键词
Inter-temporal choice; Delay discounting; Impulsivity; Reinforcement learning;

机译：跨时选择时延折扣冲动强化学习;
入库时间 2022-08-18 12:21:30

相似文献

外文文献
中文文献
专利

1. Reinforcement learning for discounted values often loses the goal in the application to animal learning [J] . Yoshiya Yamaguchi, Yutaka Sakai Neural Networks: The Official Journal of the International Neural Network Society . 2012,第Nova期

机译：折扣价值的强化学习常常失去了在动物学习中的应用目标
2. Human reinforcement learning subdivides structured action spaces by learning effector-specific values. [J] . Gershman SJ, Pesaran B, Daw ND The Journal of Neuroscience: The Official Journal of the Society for Neuroscience . 2009,第43期

机译：人类强化学习通过学习特定于效应器的值来细分结构化的动作空间。
3. Learning the Gain Values and Discount Factors of Discounted Cumulative Gains [J] . Zhou Ke, Zha Hongyuan, Chang Yi, IEEE Transactions on Knowledge and Data Engineering . 2014,第2期

机译：学习折现累积收益的收益值和折现因子
4. To Discount or not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning [C] . Sridhar Mahadevan Machine learning . 1994

机译：强化学习中要折扣还是不折扣：R学习和Q学习比较的案例研究
5. Understanding Model-Based Reinforcement Learning and its Application in Safe Reinforcement Learning [D] . Hu, Dingcheng . 2019

机译：了解基于模型的强化学习及其在安全强化学习中的应用
6. Human Reinforcement Learning Subdivides Structured Action Spaces by Learning Effector-Specific Values [O] . Samuel J. Gershman, Bijan Pesaran, Nathaniel D. Daw 2009

机译：人类强化学习通过学习效应子特定值来细分结构化的动作空间
7. Reinforcement learning for discounted values often loses the goal in the application to animal learning [O] . Yamaguchi Yoshiya, Sakai Yutaka 2012

机译：折扣价值的强化学习常常失去了在动物学习中的应用目标

Reinforcement learning for discounted values often loses the goal in the application to animal learning

摘要

著录项

相似文献

相关主题

期刊订阅