首页> 外文会议>2010 Second Brazilian Workshop on Social Simulation: Advances in Social Simulation >On the Problem of Recognizing and Learning Observable Social Exchange Strategies in Open Societies
【24h】

On the Problem of Recognizing and Learning Observable Social Exchange Strategies in Open Societies

机译:开放社会中认识和学习可观察的社会交往策略问题

获取原文

摘要

Regulation of social exchanges refers to controlling social exchanges between agents so that the balance of exchange values involved in the exchanges are continuously kept - as far as possible - near to equilibrium. Previous work modeled the social exchange regulation problem as a POMDP, and defined the policy To BDI plans algorithm to extract BDI plans from POMDP models, so that the derived BDI plans can be applied to keep in equilibrium social exchanges performed by BDI agents. The aim of this paper is to extend that BDI-POMDP agent model for the self-regulation of social exchanges with a HMM-based module for recognizing and learning partner agents' social exchange strategies, thus extending its applicability to open societies, where new partner agents can freely appear at any time. For the recognition problem, the BDI-POMDP-HMM agent proceeds by analyzing the patterns of refusals for exchange proposals that are present in a partner agent's behavior. For the learning problem, it learns HMM to capture probabilistic state transition and observation functions that model the social exchange strategy of the partner agent. The agent then transforms the HMM's transition and observation functions into POMDP's action-based state transition and observation functions, obtaining a POMDP model of the partner's previously unknown social exchange strategy, and deriving corresponding exchange regulation plans through policy To BDI plans. The paper also presents a discussion of the results of some simulations.
机译:调节社会交流是指控制代理人之间的社会交流,以使交流所涉及的交换价值的平衡不断地保持(尽可能)接近平衡。先前的工作将社会交换监管问题建模为POMDP,并定义了“ BDI计划”策略算法以从POMDP模型中提取BDI计划,以便可以将派生的BDI计划用于保持BDI代理执行的均衡社会交换。本文的目的是通过基于HMM的模块来扩展BDI-POMDP代理模型,以进行社会交流的自我调节,该模块用于识别和学习伙伴代理的社会交换策略,从而将其适用性扩展到新伙伴在开放社会中的适用性。代理商可以随时自由出现。对于识别问题,BDI-POMDP-HMM代理通过分析伙伴代理的行为中存在的交换建议的拒绝模式来进行处理。对于学习问题,它学习HMM来捕获建模伙伴代理人社交策略的概率状态转换和观察功能。然后,代理将HMM的转换和观察功能转换为POMDP基于动作的状态转换和观察功能,获得合作伙伴以前未知的社会交换策略的POMDP模型,并通过从策略到BDI计划的推导得出相应的交换监管计划。本文还提出了一些模拟结果的讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号