Probability matching and reinforcement learning

Rivas J.

首页> 外文期刊>Journal of Mathematical Economics >Probability matching and reinforcement learning

【24h】

Probability matching and reinforcement learning

机译：概率匹配和强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Probability matching occurs when an action is chosen with a frequency equivalent to the probability of that action being the best choice. This sub-optimal behavior has been reported repeatedly by psychologists and experimental economists. We provide an evolutionary foundation for this phenomenon by showing that learning by reinforcement can lead to probability matching and, if the learning occurs sufficiently slowly, probability matching does not only occur in choice frequencies but also in choice probabilities. Our results are completed by proving that there exists no quasi-linear reinforcement learning specification such that the behavior is optimal for all environments where counterfactuals are observed.

机译：当选择一个动作的频率等于该动作是最佳选择的概率时，就会发生概率匹配。心理学家和实验经济学家已经多次报告过这种次优的行为。我们通过证明强化学习可以导致概率匹配，从而为这种现象提供了进化基础。如果学习足够缓慢，则概率匹配不仅会出现在选择频率上，而且还会出现在选择概率上。通过证明不存在准线性强化学习规范来完成我们的结果，这样的行为对于观察到反事实的所有环境都是最佳的。

著录项

来源
《Journal of Mathematical Economics》 |2013年第1期|共5页
作者
Rivas J.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类经济计算、经济数学方法;
关键词
Probability matching; Reinforcement learning;

机译：概率匹配;强化学习;

相似文献

外文文献
中文文献
专利

1. Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis [J] . Carolina Feher da Silva, Camila Gomes Victorino, Nestor Caticha, Scientific reports. . 2017,第1期

机译：勘探和新近度作为概率匹配的主要近似原因：加固学习分析
2. A Reinforcement Learning Method Using a Dynamic Reinforcement Function Based on Action Selection Probability [J] . Yugo Hasegawa, Satoko Takada, Hidehiro Nakano, Systems and Computers in Japan . 2007,第7期

机译：基于动作选择概率的动态强化函数强化学习方法
3. Optimizing matching time intervals for ride-hailing services using reinforcement learning [J] . Qin Guoyang, Luo Qi, Yin Yafeng, Transportation research . 2021,第Auga期

机译：使用强化学习优化乘车服务的匹配时间间隔
4. Simulating probability learning and probabilistic reversal learning using the attention-gated reinforcement learning (AGREL) model [C] . Erdeniz Burak, Atalay Nart Bedin The 2010 International Joint Conference on Neural Networks . 2010

机译：使用注意门强化学习（AGREL）模型模拟概率学习和概率逆向学习
5. Youths' motivated attention and reinforcement matching characteristics as moderators of the effects of parenting on adolescent communication and externalizing behaviors: A social learning, matching law approach [D] . Goodnight, Jackson A. 2010

机译：青少年的动机激励和强化匹配特征作为父母对青少年沟通和外在行为影响的调节剂：一种社会学习，匹配法
6. Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis [O] . Carolina Feher da Silva, Camila Gomes Victorino, Nestor Caticha, -1

机译：探索和新近度是概率匹配的主要原因：加强学习分析
7. Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis [O] . Carolina Feher da Silva, Camila Gomes Victorino, Nestor Caticha, 2017

机译：勘探和新近度作为概率匹配的主要近似原因：加固学习分析

Probability matching and reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅