首页> 外文期刊>Journal of the Brazilian Computer Society >Recognizing and learning models of social exchange strategies for the regulation of social interactions in open agent societies
【24h】

Recognizing and learning models of social exchange strategies for the regulation of social interactions in open agent societies

机译:开放交流社会中调节社交互动的社交交流策略的认知和学习模型

获取原文
获取原文并翻译 | 示例
       

摘要

Regulation of social exchanges refers to controlling social exchanges between agents so that the balance of exchange values involved in the exchanges are continuously kept—as far as possible—near to equilibrium. Previous work modeled the social exchange regulation problem as a POMDP (Partially Observable Markov Decision Process), and defined the policyToBDIplans algorithm to extract BDI (Beliefs, Desires, Intentions) plans from POMDP models, so that the derived BDI plans can be applied to keep in equilibrium social exchanges performed by BDI agents. The aim of the present paper is to extend that BDI-POMDP agent model for self-regulation of social exchanges with a module, based on HMM (Hidden Markov Model), for recognizing and learning partner agents’ social exchange strategies, thus extending its applicability to open societies, where new partner agents can freely appear at any time. For the recognition problem, patterns of refusals of exchange proposals are analyzed, as such refusals are produced by the partner agents. For the learning problem, HMMs are used to capture probabilistic state transition and observation functions that model the social exchange strategy of the partner agent, in order to translate them into POMDP’s action-based state transition and observation functions. The paper formally addresses the problem of translating HMMs into POMDP models and vice versa, introducing the translation algorithms and some examples. A discussion on the results of simulations of strategy-based social exchanges is presented, together with an analysis about related work on social exchanges in multiagent systems.
机译:调节社会交流是指控制代理人之间的社会交流,以便使交流所涉及的交换价值平衡不断(尽可能)保持接近平衡。先前的工作将社会交换监管问题建模为POMDP(部分可观察的马尔可夫决策过程),并定义了policyToBDIplans算法以从POMDP模型中提取BDI(信念,愿望,意图)计划,以便可以将派生的BDI计划用于保持在BDI代理人进行的均衡社会交流中本文的目的是通过基于HMM(隐马尔可夫模型)的模块扩展BDI-POMDP代理模型,以实现社会交流的自我调节,从而识别和学习伙伴代理的社会交换策略,从而扩展其适用性。开放的社会,新的合作伙伴代理商可以随时自由出现。对于识别问题,将分析拒绝交换提议的方式,因为这种拒绝是由伙伴代理产生的。对于学习问题,HMM用于捕获对伙伴代理的社会交换策略建模的概率状态转换和观察功能,以便将其转换为POMDP基于动作的状态转换和观察功能。本文正式解决了将HMM转换为POMDP模型的问题,反之亦然,介绍了转换算法和一些示例。讨论了基于策略的社交交流的模拟结果,并对有关多主体系统中社交交流的相关工作进行了分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号