首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >Reinforcement learning for discounted values often loses the goal in the application to animal learning
【24h】

Reinforcement learning for discounted values often loses the goal in the application to animal learning

机译:折扣价值的强化学习常常失去了在动物学习中的应用目标

获取原文
获取原文并翻译 | 示例
       

摘要

The impulsive preference of an animal for an immediate reward implies that it might subjectively discount the value of potential future outcomes. A theoretical framework to maximize the discounted subjective value has been established in the reinforcement learning theory. The framework has been successfully applied in engineering. However, this study identified a limitation when applied to animal behavior, where in some cases, there is no learning goal. Here a possible learning framework was proposed that is well-posed in any cases and that is consistent with the impulsive preference.
机译:动物对即时奖励的冲动偏好意味着它可能在主观上低估了潜在未来结果的价值。在强化学习理论中已经建立了最大化折现主观价值的理论框架。该框架已成功应用于工程中。但是,这项研究发现了在应用于动物行为时的局限性,在某些情况下,这没有学习目标。在这里,提出了一个可能的学习框架,该框架在任何情况下都具有良好的条件,并且与冲动性偏好保持一致。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号