...
首页> 外文期刊>Journal of Experimental Psychology. General >BRIEF REPORT Humans Use Directed and Random Exploration to Solve the Explore-Exploit Dilemma
【24h】

BRIEF REPORT Humans Use Directed and Random Exploration to Solve the Explore-Exploit Dilemma

机译:简要报告人类使用定向随机探索解决探索-利用难题

获取原文
获取原文并翻译 | 示例
           

摘要

All adaptive organisms face the fundamental tradeoff between pursuing a known reward (exploitation) and sampling lesser-known options in search of something better (exploration). Theory suggests at least two strategies for solving this dilemma: a directed strategy in which choices are explicitly biased toward information seeking, and a random strategy in which decision noise leads to exploration by chance. In this work we investigated the extent to which humans use these two strategies. In our "Horizon task," participants made explore- exploit decisions in two contexts that differed in the number of choices that they would make in the future (the time horizon). Participants were allowed to make either a single choice in each game (horizon 1), or 6 sequential choices (horizon 6), giving them more opportunity to explore. By modeling the behavior in these two conditions, we were able to measure exploration-related changes in decision making and quantify the contributions of the two strategies to behavior. We found that participants were more information seeking and had higher decision noise with the longer horizon, suggesting that humans use both strategies to solve the exploration- exploitation dilemma. We thus conclude that both information seeking and choice variability can be controlled and put to use in the service of exploration.
机译:所有适应性有机体都面临着追求已知奖励(开发)和对鲜为人知的选项进行采样以寻求更好的东西(探索)之间的基本权衡。理论上提出了至少两种解决这个难题的策略:一种直接选择偏向于信息搜索的定向策略,以及一种决策噪声导致偶然探索的随机策略。在这项工作中,我们调查了人类使用这两种策略的程度。在我们的“地平线任务”中,参与者在两种情况下做出了探索,利用的决策,这两种情况的不同之处在于他们将来会做出的选择数量(时间范围)。参与者可以在每个游戏中做出单个选择(水平1),也可以做出6个连续选择(水平6),从而给他们更多的探索机会。通过对这两种情况下的行为进行建模,我们能够测量与决策相关的探索相关变化,并量化这两种策略对行为的贡献。我们发现,参与者寻求更多的信息,并且在更长的视野中具有更高的决策噪音,这表明人类同时使用两种策略来解决勘探开发难题。因此,我们得出结论,信息的搜寻和选择的可变性都可以得到控制,并可以用于勘探服务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号