首页> 外文会议>IEEE 5th International Bio-Inspired Computing: Theories and Applications >A dynamical policy search model for matching law

【24h】

A dynamical policy search model for matching law

机译：匹配律的动态策略搜索模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The matching law states that the fraction of choices made to any option will match the fraction of total rewards earned from that option. However, the income earned from conducting the matching behavior does not imply that it will get the optimal reward. It is unclear why subjects frequently exhibit the matching behavior rather than the optimal behavior. In this study, on the basis of the policy search model in reinforcement learning, an optimal algorithm is proposed, and the policy algorithm leading to matching law is derived from the optimal algorithm. Theoretical analysis and simulation results show that the decision behavior achieved by our algorithm is able to reach matching law in many kinds of reward schedules. Our results indicate that matching law can be exhibited whenever the subject tries to maximize a value function under a simple assumption that past choice behavior does not care about the values of future long-run reward. This results unveil the relationships between the matching behavior and the algorithm of optimal policy search.

机译：匹配法则指出，对任何期权做出的选择的比例将与从该期权获得的总报酬的比例相匹配。但是，通过进行匹配行为获得的收入并不意味着它将获得最佳回报。目前尚不清楚为什么受试者经常表现出匹配行为而不是最佳行为。本文在强化学习策略搜索模型的基础上，提出了一种优化算法，并从该算法中得出了导致匹配律的策略算法。理论分析和仿真结果表明，我们的算法实现的决策行为能够在多种奖励计划中达到匹配律。我们的研究结果表明，只要受试者在过去的选择行为并不关心未来长期奖励的价值的简单假设下，只要试图最大化价值功能，就可以展示出匹配法则。该结果揭示了匹配行为与最佳策略搜索算法之间的关系。

著录项

来源
《IEEE 5th International Bio-Inspired Computing: Theories and Applications 》|2010年|P.127-131|共5页
会议地点
作者
Cheng Zhenbo; Deng Zhidong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法 ;
关键词

相似文献

外文文献
中文文献
专利

1. 动态形状模型中最优搜索空间和最优搜索过程 [J] . 何良华, 邹采荣, 赵力, 东南大学学报（英文版） . 2005 ,第003期
2. Can the baseline search and matching model quantitatively explain Okun's law? [J] . Cheuk Yin Ho Applied Economics . 2014 ,第31a33期

机译：基线搜索和匹配模型可以定量地解释奥肯定律吗？
3. Optimal fiscal policy in a model with search-and-matching frictions: the case of Bulgaria (1999-2018) [J] . Vasilev Aleksandar Post-Communist Economies . 2021 ,第4期

机译：搜索和匹配摩擦模型中的最佳财政政策：保加利亚的情况（1999-2018）
4. Efficiency in a search and matching model with participation policy [J] . Masters Adrian Economics letters . 2015 ,第sepa期

机译：具有参与政策的搜索和匹配模型的效率
5. A dynamical policy search model for matching law [C] . Cheng Zhenbo, Deng Zhidong International Conference on Bio-Inspired Computing: Theories and Applications . 2010

机译：匹配法的动态政策搜索模型
6. Essays on Multidimensional Search and Matching Models [D] . Safak, Veli 2019

机译：关于多维搜索和匹配模型的论文
7. Dynamic partitioning of search patterns for approximate pattern matching using search schemes [O] . Luca Renders, Kathleen Marchal, Jan Fostier 2021

机译：使用搜索方案进行近似模式匹配的搜索模式的动态分区
8. Describing the Dynamics of Distribution in Search and Matching Models by Fokker-Planck Equations [O] . Wue4lde Klaus, Bayer Christian 2011

机译：用Fokker-planck方程描述搜索和匹配模型中的分布动力学
9. Dynamic Modeling of Starting Aerodynamics and Stage Matching in an Axi-Centrifugal Compressor [R] . Wilkes, Kevin, OBrien, Walter F., Owen, A. Karl 1996

机译：axi离心压缩机启动空气动力学和阶段匹配的动态建模

A dynamical policy search model for matching law

摘要

著录项

相似文献

相关主题

期刊订阅