...
首页> 外文期刊>Journal of Theoretical Biology >The application of temporal difference learning in optimal diet models
【24h】

The application of temporal difference learning in optimal diet models

机译:时间差异学习在最佳饮食模型中的应用

获取原文
获取原文并翻译 | 示例

摘要

An experience-based aversive learning model of foraging behaviour in uncertain environments is presented. We use Q-learning as a model-free implementation of Temporal difference learning motivated by growing evidence for neural correlates in natural reinforcement settings. The predator has the choice of including an aposematic prey in its diet or to forage on alternative food sources. We show how the predator's foraging behaviour and energy intake depend on toxicity of the defended prey and the presence of Batesian mimics. We introduce the precondition of exploration of the action space for successful aversion formation and show how it predicts foraging behaviour in the presence of conflicting rewards which is conditionally suboptimal in a fixed environment but allows better adaptation in changing environments.
机译:提出了一种在不确定环境下觅食行为的基于经验的厌恶学习模型。我们使用Q学习作为时间差异学习的无模型实现,该学习是由自然强化环境中神经相关的证据不断增长所激发的。捕食者可以选择在其饮食中包括无定形的猎物,或在其他食物来源中觅食。我们展示了捕食者的觅食行为和能量摄入如何取决于被保护猎物的毒性和贝茨模拟物的存在。我们介绍了探索行为空间以成功形成厌恶行为的前提,并展示了它如何在存在冲突奖励的情况下预测觅食行为,该奖励条件在固定环境中条件不理想,但可以在变化的环境中更好地适应。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号