Theory of Choice in Bandit Information Sampling and Foraging Tasks

机译：强盗信息采样和觅食任务中的选择理论

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Decision making has been studied with a wide array of tasks. Here we examine the theoretical structure of bandit, information sampling and foraging tasks. These tasks move beyond tasks where the choice in the current trial does not affect future expected rewards. We have modeled these tasks using Markov decision processes (MDPs). MDPs provide a general framework for modeling tasks in which decisions affect the information on which future choices will be made. Under the assumption that agents are maximizing expected rewards, MDPs provide normative solutions. We find that all three classes of tasks pose choices among actions which trade-off immediate and future expected rewards. The tasks drive these trade-offs in unique ways, however. For bandit and information sampling tasks, increasing uncertainty or the time horizon shifts value to actions that pay-off in the future. Correspondingly, decreasing uncertainty increases the relative value of actions that pay-off immediately. For foraging tasks the time-horizon plays the dominant role, as choices do not affect future uncertainty in these tasks.

机译：决策制定已经研究了很多任务。在这里，我们研究了土匪，信息采样和觅食任务的理论结构。这些任务超出了当前试验中的选择不会影响未来预期奖励的任务。我们已经使用马尔可夫决策过程（MDP）对这些任务进行了建模。 MDP为建模任务提供了一个通用框架，在该框架中，决策会影响信息，以便将来做出选择。在代理人最大化预期回报的假设下，MDP提供了规范解决方案。我们发现，所有这三类任务都会在权衡即时和未来预期奖励的行动中做出选择。但是，这些任务以独特的方式推动了这些折衷。对于强盗和信息采样任务，不确定性增加或时间跨度将价值转移到将来会获得回报的行动上。相应地，减少不确定性会增加立即获得回报的行动的相对价值。对于觅食任务，时间地平线起着主导作用，因为选择不会影响这些任务的未来不确定性。

著录项

期刊名称 PLoS Computational Biology
作者
Bruno B. Averbeck;
展开▼
作者单位

展开▼
年(卷),期 2015(11),3
年度 2015
页码 e1004164
总页数 28
原文格式 PDF
正文语种
中图分类生化遗传学;生化药理学;
关键词

相似文献

外文文献
中文文献
专利

1. Theory of Choice in Bandit, Information Sampling and Foraging Tasks [J] . Bruno B. Averbeck PLoS Computational Biology . 2015,第3期

机译：强盗，信息采样和觅食任务中的选择理论
2. Theory of Choice in Bandit, Information Sampling and Foraging Tasks [J] . Bruno B. Averbeck PLoS Computational Biology . 2015,第3期

机译：强盗，信息采样和觅食任务中的选择理论
3. Reassessing intertemporal choice: human decision-making is more optimal in a foraging task than in a self-control task [J] . Evan C. Carter, Eric J. Pedersen, Michael E. McCullough Frontiers in Psychology . 2015,第4期

机译：重新评估跨期选择：与自控任务相比，人类的决策在觅食任务中更为理想
4. Foraging theory for decision-making system design: task-type choice [C] . Andrews, B.W., Passino, . 2004

机译：决策系统设计的觅食理论：任务类型选择
5. Risk sensitivity in intertemporal choice: A synthesis of foraging and behavioral economic theory. [D] . Smith, Carter L. 2004

机译：跨期选择中的风险敏感性：觅食与行为经济学理论的综合。
6. Basal Ganglia Preferentially Encode Context Dependent Choice in a Two-Armed Bandit Task [O] . André Garenne, Benjamin Pasquereau, Martin Guthrie, 2011

机译：在两臂强盗任务中基础神经节优先编码上下文相关选择
7. Theory of choice in bandit, information sampling and foraging tasks. [O] . Bruno B Averbeck 2015

机译：强盗，信息抽样和觅食任务的选择理论。

Theory of Choice in Bandit Information Sampling and Foraging Tasks

摘要

著录项

相似文献

相关主题

期刊订阅