The economics analysis of a Q -learning model of cooperation with punishment and risk taking preferences

Solferino Nazaria; Solferino Viviana; Taurino Serena F.

首页> 外文期刊>Journal of Economic Interaction and Coordination >The economics analysis of a Q -learning model of cooperation with punishment and risk taking preferences

【24h】

The economics analysis of a Q -learning model of cooperation with punishment and risk taking preferences

机译：带有惩罚和冒险偏好的合作的Q学习模型的经济学分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The aim of this paper is to better understand how cooperation mechanisms work in the context of a Q-learning model. We apply a learning reinforcement model to analyse the conditions needed to have a stable cooperative equilibrium when people take part in a common project and could take advantages of free-riding. Our results show that a stable equilibrium can be reached thank to mechanisms of punishment, but the final result strongly depends on the risk-taking individuals' preferences. In particular, we find that the penalties will be effective only with people having high exploration rates,namely with people able to adapt their strategies and learn to cooperate. Otherwise, it is possible to have an unstable equilibrium with cooperation until individuals have a very high intrinsic motivation to cooperate, whatever the others do.

机译：本文的目的是更好地了解在Q学习模型的背景下合作机制如何工作。我们应用学习强化模型来分析人们参与共同项目并可以利用搭便车的优势时，要有一个稳定的合作平衡的条件。我们的结果表明，借助惩罚机制可以达到稳定的平衡，但最终结果很大程度上取决于冒险个人的偏好。尤其是，我们发现，只有对勘探率高的人，即对能够适应其策略并学会合作的人，处罚才有效。否则，可能存在不稳定的合作平衡，直到个体具有非常高的内在动力来进行合作，无论其他人做什么。

著录项

来源
《Journal of Economic Interaction and Coordination》 |2018年第3期|601-613|共13页
作者
Solferino Nazaria; Solferino Viviana; Taurino Serena F.;
展开▼
作者单位

Univ Roma Tor Vergata, Econ Dept, Rome, Italy;

Univ Calabria, Math & Comp Sci Dept, Arcavacata Di Rende, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Cooperation; Punishment; Q-learning models; Risk preferences;

机译：合作;惩罚;Q学习模型;风险偏好;

相似文献

外文文献
中文文献
专利

1. The economics analysis of a Q-learning model of cooperation with punishment and risk taking preferences [J] . Solferino Nazaria, Solferino Viviana, Taurino Serena F. Journal of land use science . 2018,第1a3期

机译：审查与风险偏出的Q学习模式的经济学分析
2. An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making, reward prediction, and punishment learning [J] . Balasubramani, Pragathi P. Frontiers in Computational Neuroscience . 2014,第4期

机译：扩展的基底神经节强化学习模型，以了解5-羟色胺和多巴胺在基于风险的决策，奖励预测和惩罚学习中的作用
3. IMPACT OF ECONOMICS LEARNING ON RISK PREFERENCES AND RATIONALITY: AN EMPIRICAL INVESTIGATION [J] . Zhengyi Zhou The American Economist . 2013,第1期

机译：经济学学习对风险偏好和合理性的影响：一项实证研究
4. The utility model analysis of “six-party cooperation + insurance” mode — Based on empirical research in insurance mode of pig-farming cooperation economic organizations [C] . Jing Tan, Shangwu Wang 2011 International Conference on Computer Science and Service System . 2011

机译：六方合作+保险模式的效用模型分析—基于养猪合作经济组织保险模式的实证研究
5. Aneurysm Rupture Risk Analysis and Risk Prediction Modeling Based on CFD Simulations and Statistical Learning [D] . Detmer, Felicitas Josephine. 2019

机译：基于CFD模拟和统计学习的动脉瘤破裂风险分析与风险预测建模
6. An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making reward prediction and punishment learning [O] . Pragathi P. Balasubramani, V. Srinivasa Chakravarthy, Balaraman Ravindran, 2014

机译：扩展的基底神经节强化学习模型以了解5-羟色胺和多巴胺在基于风险的决策奖励预测和惩罚学习中的作用
7. AN EXTENDED REINFORCEMENT LEARNING MODEL OF BASAL GANGLIA TO UNDERSTAND THE CONTRIBUTIONS OF SEROTONIN AND DOPAMINE IN RISK-BASED DECISION MAKING, REWARD PREDICTION, AND PUNISHMENT LEARNING [O] . Pragathi Priyadharsini Balasubramani, Srinivasa eChakravarthy, Ravindran eBalaraman, 2014

机译：基于风险的决策，奖励预测和惩罚学习中基础神经节的扩展学习模型，以了解血清素和多巴胺的贡献

The economics analysis of a Q -learning model of cooperation with punishment and risk taking preferences

摘要

著录项

相似文献

相关主题

期刊订阅