首页> 外文会议>International Joint Conference on Autonomous Agents and Multiagent Systems >MB-AIM-FSI: A Model Based Framework for exploiting gradient ascent MultiAgent Learners in Strategic Interactions
【24h】

MB-AIM-FSI: A Model Based Framework for exploiting gradient ascent MultiAgent Learners in Strategic Interactions

机译:MB-AIM-FSI:基于模型的框架,用于利用战略互动中的渐变上升多书学习者

获取原文

摘要

Future agent applications will increasingly represent human users autonomously or semi-autonomously in strategic interactions with similar entities. Hence, there is a growing need to develop algorithmic approaches that can learn to recognize commonalities in opponent strategies and exploit such commonalities to improve strategic response. Recently a framework [9] has been proposed that aims for targeted optimality against a set of finite memory opponents. We propose an approach that aims for targeted optimality against the set of all possible multiagent learning algorithms that perform gradient search to select a single stage Nash Equilibria of a repeated game. Such opponents induce a Markov Decision Process as the learning environment and appropriate responses to such environments are learned by assuming a generative model of the environment. In the absence of a generative model, we present a framework, MB-AIM-FSI, that models the opponent online based on interactions, solves the model off-line when sufficient information has been gathered, stores the strategy in the repository and finally uses it judiciously when playing against the same or similar opponent at a later time.
机译:未来代理申请将越来越多地代表人类用户在与类似实体的战略互动中自主或半自动。因此,越来越需要开发算法方法,这些方法可以学会识别对手战略中的共性,并利用这种共性来提高战略反应。最近,已经提出了一个框架[9],其旨在针对一组有限记忆对手的目标最优性。我们提出了一种方法,该方法旨在针对针对执行梯度搜索的所有可能的多学习算法的集合来选择重复游戏的单级NASH均衡。这种对手诱导马尔可夫决定过程作为学习环境,并通过假设环境的生成模型来学习对这种环境的适当响应。在没有生成模型的情况下,我们介绍了一个框架MB-AIM-FSI,即根据交互在线在线模型,在收集足够的信息时解决了模型的离线,将策略存储在存储库中并最终使用在以后在同一或相似的对手上玩耍时明智地。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号