首页> 外文会议>International Conference on Autonomous Agents and Multiagent Systems >The Education of a Crook: Reinforcement Learning in Social-Cultural Settings
【24h】

The Education of a Crook: Reinforcement Learning in Social-Cultural Settings

机译:骗子教育:社会文化环境中的加固学习

获取原文

摘要

The ability to manipulate social and cultural values in order to achieve one's own goals is a hard-to-teach but profitable skill. In this paper we represent a complex social scenario, the Spanish Steps flower selling scam, using a social calculus framework based on culture sanctioned social metrics (CSSMs) and concrete beliefs (CBs). Then, we show how a crooked seller can learn a profitable strategy through reinforcement learning. Although the search space defined by the social calculus is large, we found that function approximation based Q-learning allows us to successfully learn efficient strategies in a relatively small number of runs. The learned strategy allows the seller to manipulate an unprepared tourist's social values of politeness and dignity, as well as his perception of the peers and crowds opinion. This allows the seller to manipulate some of his opponents to act against their own interests by purchasing an overpriced flower while well-knowing that they are being cheated.
机译:操纵社会和文化价值的能力为了实现自己的目标是难以教导但有利可图的技能。在本文中,我们代表了一个复杂的社会场景,西班牙语步骤销售骗局,利用基于文化的社会指标(CSSMS)和具体信仰(CBS)的社会微积分框架。然后,我们展示了弯曲的卖家如何通过加强学习来学习盈利战略。虽然由社会微积分定义的搜索空间很大,但我们发现基于函数近似的Q-Learning允许我们在相对少量的运行中成功学习高效的策略。学习策略使卖方能够操纵毫无准备的旅游人的礼貌和尊严的社会价值,以及他对同龄人和人群意见的看法。这使得卖方可以通过购买价格过高的花卉,操纵他的一些对手来违背自己的利益,同时知道他们被欺骗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号