...
首页> 外文期刊>Journal of Mathematical Economics >Probability matching and reinforcement learning
【24h】

Probability matching and reinforcement learning

机译:概率匹配和强化学习

获取原文
获取原文并翻译 | 示例
           

摘要

Probability matching occurs when an action is chosen with a frequency equivalent to the probability of that action being the best choice. This sub-optimal behavior has been reported repeatedly by psychologists and experimental economists. We provide an evolutionary foundation for this phenomenon by showing that learning by reinforcement can lead to probability matching and, if the learning occurs sufficiently slowly, probability matching does not only occur in choice frequencies but also in choice probabilities. Our results are completed by proving that there exists no quasi-linear reinforcement learning specification such that the behavior is optimal for all environments where counterfactuals are observed.
机译:当选择一个动作的频率等于该动作是最佳选择的概率时,就会发生概率匹配。心理学家和实验经济学家已经多次报告过这种次优的行为。我们通过证明强化学习可以导致概率匹配,从而为这种现象提供了进化基础。如果学习足够缓慢,则概率匹配不仅会出现在选择频率上,而且还会出现在选择概率上。通过证明不存在准线性强化学习规范来完成我们的结果,这样的行为对于观察到反事实的所有环境都是最佳的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号