...
首页> 外文期刊>Annals of Mathematics and Artificial Intelligence >A unifying learning framework for building artificial game-playing agents
【24h】

A unifying learning framework for building artificial game-playing agents

机译:建立人工游戏代理商的统一学习框架

获取原文
获取原文并翻译 | 示例
           

摘要

This paper investigates learning-based agents that are capable of mimicking human behavior in game playing, a central task in computational economics. Although computational economists have developed various game-playing agents, well-established machine learning methods such as graphical models have not been applied before. Leveraging probabilistic graphical models, this paper presents a novel sequential Bayesian network (SBN) framework for building artificial game-playing agents. We show that many existing agents, including reinforcement learning, fictitious play, and many of their variants, have a unified Bayesian explanation within the proposed SBN framework. Moreover, we discover that SBN can handle various important settings of game playing, allowing for a broad scope of its use in economics. SBN not only provides a unifying and satisfying framework to explain existing learning approaches in virtual economies, but also enables the development of new algorithms that are stronger or have fewer restrictions. In this paper, we derive a new algorithm, Hidden Markovian Play (HMP), from the generic SBN model to handle an important but difficult setting in which a player cannot observe the opponent's strategy and payoff. It leverages Markovian learning to infer unobservable information, leading to higher quality of the agents. Experiments on real-world field experiments in evaluating economies show that our HMP model outperforms the baseline algorithms for building artificial agents.
机译:本文研究了能够模仿游戏中人类行为的基于学习的主体,这是计算经济学中的核心任务。尽管计算经济学家已经开发了各种游戏代理,但之前尚未应用成熟的机器学习方法(例如图形模型)。利用概率图形模型,本文提出了一种新颖的顺序贝叶斯网络(SBN)框架,用于构建人工游戏代理。我们表明,许多现有代理(包括强化学习,虚拟游戏及其许多变体)在建议的SBN框架内具有统一的贝叶斯解释。此外,我们发现SBN可以处理游戏的各种重要设置,从而使其在经济学中具有广泛的用途。 SBN不仅提供一个统一且令人满意的框架来解释虚拟经济中的现有学习方法,而且还可以开发更强大或限制更少的新算法。在本文中,我们从通用SBN模型中衍生出一种新算法,即隐马尔可夫游戏(HMP),以应对玩家无法观察对手的策略和收益的重要但困难的设置。它利用马尔可夫学习来推断无法观察的信息,从而提高了代理的质量。在评估经济性的现实世界中进行的现场实验表明,我们的HMP模型优于构建人工代理的基准算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号