首页> 外文会议>Brazilian Workshop on Social Simulation >On the Problem of Recognizing and Learning Observable Social Exchange Strategies in Open Societies
【24h】

On the Problem of Recognizing and Learning Observable Social Exchange Strategies in Open Societies

机译:论公开社会中认识与学习可观察社会交换策略的问题

获取原文

摘要

Regulation of social exchanges refers to controlling social exchanges between agents so that the balance of exchange values involved in the exchanges are continuously kept - as far as possible - near to equilibrium. Previous work modeled the social exchange regulation problem as a POMDP, and defined the policy To BDI plans algorithm to extract BDI plans from POMDP models, so that the derived BDI plans can be applied to keep in equilibrium social exchanges performed by BDI agents. The aim of this paper is to extend that BDI-POMDP agent model for the self-regulation of social exchanges with a HMM-based module for recognizing and learning partner agents' social exchange strategies, thus extending its applicability to open societies, where new partner agents can freely appear at any time. For the recognition problem, the BDI-POMDP-HMM agent proceeds by analyzing the patterns of refusals for exchange proposals that are present in a partner agent's behavior. For the learning problem, it learns HMM to capture probabilistic state transition and observation functions that model the social exchange strategy of the partner agent. The agent then transforms the HMM's transition and observation functions into POMDP's action-based state transition and observation functions, obtaining a POMDP model of the partner's previously unknown social exchange strategy, and deriving corresponding exchange regulation plans through policy To BDI plans. The paper also presents a discussion of the results of some simulations.
机译:社会交易所的监管是指控制代理人之间的社会交易所,以便持续持续 - 尽可能持续地保持交流的汇率余额 - 靠近均衡。以前的工作建模为社会交流监管问题作为POMDP,并将策略定义为BDI计划算法从POMDP模型中提取BDI计划,以便可应用于BDI代理商执行的均衡社交交换。本文的目的是将BDI-POMDP代理模型扩展了社会交易所的社会交易所的自我调节,以认识到伴随和学习合作伙伴代理商的社会交换策略,从而将其适用于开放社会,在新的伴侣代理商可以随时自由出现。对于识别问题,BDI-POMDP-HMM代理通过分析伙伴代理人行为中存在的交换建议的拒绝模式进行进行。对于学习问题,它会学习嗯,以捕获概率的状态转换和观察功能,这些功能模拟伙伴代理人的社会交换策略。然后,代理将嗯的过渡和观察功能转变为POMDP的基于行动的状态转换和观察功能,从而获得了合作伙伴以前未知的社会交换策略的POMDP模型,并通过对BDI计划的政策导出相应的交换规则计划。本文还讨论了一些模拟结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号