首页> 美国卫生研究院文献>Scientific Reports >Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis

【2h】

Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis

机译：探索和新近度是概率匹配的主要原因：加强学习分析

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Research has not yet reached a consensus on why humans match probabilities instead of maximise in a probability learning task. The most influential explanation is that they search for patterns in the random sequence of outcomes. Other explanations, such as expectation matching, are plausible, but do not consider how reinforcement learning shapes people’s choices. We aimed to quantify how human performance in a probability learning task is affected by pattern search and reinforcement learning. We collected behavioural data from 84 young adult participants who performed a probability learning task wherein the majority outcome was rewarded with 0.7 probability, and analysed the data using a reinforcement learning model that searches for patterns. Model simulations indicated that pattern search, exploration, recency (discounting early experiences), and forgetting may impair performance. Our analysis estimated that 85% (95% HDI [76, 94]) of participants searched for patterns and believed that each trial outcome depended on one or two previous ones. The estimated impact of pattern search on performance was, however, only 6%, while those of exploration and recency were 19% and 13% respectively. This suggests that probability matching is caused by uncertainty about how outcomes are generated, which leads to pattern search, exploration, and recency.

机译：关于人类为何匹配概率而不是在概率学习任务中最大化的研究尚未达成共识。最具影响力的解释是，他们以结果的随机顺序搜索模式。诸如期望匹配之类的其他解释是合理的，但没有考虑强化学习如何影响人们的选择。我们旨在量化概率学习任务中的人类表现如何受到模式搜索和强化学习的影响。我们收集了84位年轻人的行为数据，这些学生执行了概率学习任务，其中大多数结果以0.7的概率得到奖励，并使用搜索模式的强化学习模型对数据进行了分析。模型仿真表明，模式搜索，探索，新近度（打折早期经验）和遗忘可能会损害性能。我们的分析估计，有85％（95％HDI [76，94]）的参与者搜索了模式，并认为每个试验的结果都取决于之前的一个或两个。模式搜索对性能的估计影响仅为6％，而探索和新近度分别为19％和13％。这表明，概率匹配是由结果生成方式的不确定性引起的，从而导致模式搜索，探索和新近度。

著录项

期刊名称 Scientific Reports
作者
Carolina Feher da Silva; Camila Gomes Victorino; Nestor Caticha; Marcus Vinícius Chrysóstomo Baldo;
展开▼
作者单位

展开▼
年(卷),期 -1(7),-1
年度 -1
页码 15326
总页数 23
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis [J] . Carolina Feher da Silva, Camila Gomes Victorino, Nestor Caticha, Scientific reports. . 2017,第1期

机译：勘探和新近度作为概率匹配的主要近似原因：加固学习分析
2. Probability matching and reinforcement learning [J] . Rivas J. Journal of Mathematical Economics . 2013,第1期

机译：概率匹配和强化学习
3. A Reinforcement Learning Method Using a Dynamic Reinforcement Function Based on Action Selection Probability [J] . Yugo Hasegawa, Satoko Takada, Hidehiro Nakano, Systems and Computers in Japan . 2007,第7期

机译：基于动作选择概率的动态强化函数强化学习方法
4. Recency-Weighted Acceleration for Continuous Control Through Deep Reinforcement Learning [C] . Zhen Wu, Zongzhang Zhang, Xiaofang Zhang International Conference on Neural Information Processing . 2020

机译：通过深度加强学习连续控制的新加权加速
5. Youths' motivated attention and reinforcement matching characteristics as moderators of the effects of parenting on adolescent communication and externalizing behaviors: A social learning, matching law approach [D] . Goodnight, Jackson A. 2010

机译：青少年的动机激励和强化匹配特征作为父母对青少年沟通和外在行为影响的调节剂：一种社会学习，匹配法
6. Probability learning as a function of momentary reinforcement probability [O] . Ben A. Williams 1972

机译：概率学习与瞬时强化概率的关系
7. Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis [O] . Carolina Feher da Silva, Camila Gomes Victorino, Nestor Caticha, 2017

机译：勘探和新近度作为概率匹配的主要近似原因：加固学习分析

Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis

摘要

著录项

相似文献

相关主题

期刊订阅