首页> 美国卫生研究院文献>Scientific Reports >Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis
【2h】

Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis

机译:探索和新近度是概率匹配的主要原因:加强学习分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Research has not yet reached a consensus on why humans match probabilities instead of maximise in a probability learning task. The most influential explanation is that they search for patterns in the random sequence of outcomes. Other explanations, such as expectation matching, are plausible, but do not consider how reinforcement learning shapes people’s choices. We aimed to quantify how human performance in a probability learning task is affected by pattern search and reinforcement learning. We collected behavioural data from 84 young adult participants who performed a probability learning task wherein the majority outcome was rewarded with 0.7 probability, and analysed the data using a reinforcement learning model that searches for patterns. Model simulations indicated that pattern search, exploration, recency (discounting early experiences), and forgetting may impair performance. Our analysis estimated that 85% (95% HDI [76, 94]) of participants searched for patterns and believed that each trial outcome depended on one or two previous ones. The estimated impact of pattern search on performance was, however, only 6%, while those of exploration and recency were 19% and 13% respectively. This suggests that probability matching is caused by uncertainty about how outcomes are generated, which leads to pattern search, exploration, and recency.
机译:关于人类为何匹配概率而不是在概率学习任务中最大化的研究尚未达成共识。最具影响力的解释是,他们以结果的随机顺序搜索模式。诸如期望匹配之类的其他解释是合理的,但没有考虑强化学习如何影响人们的选择。我们旨在量化概率学习任务中的人类表现如何受到模式搜索和强化学习的影响。我们收集了84位年轻人的行为数据,这些学生执行了概率学习任务,其中大多数结果以0.7的概率得到奖励,并使用搜索模式的强化学习模型对数据进行了分析。模型仿真表明,模式搜索,探索,新近度(打折早期经验)和遗忘可能会损害性能。我们的分析估计,有85%(95%HDI [76,94])的参与者搜索了模式,并认为每个试验的结果都取决于之前的一个或两个。模式搜索对性能的估计影响仅为6%,而探索和新近度分别为19%和13%。这表明,概率匹配是由结果生成方式的不确定性引起的,从而导致模式搜索,探索和新近度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号