A unifying learning framework for building artificial game-playing agents

Chen Wenlin; Chen Yixin; Levine David K.

首页> 外文期刊>Annals of Mathematics and Artificial Intelligence >A unifying learning framework for building artificial game-playing agents

【24h】

A unifying learning framework for building artificial game-playing agents

机译：建立人工游戏代理商的统一学习框架

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper investigates learning-based agents that are capable of mimicking human behavior in game playing, a central task in computational economics. Although computational economists have developed various game-playing agents, well-established machine learning methods such as graphical models have not been applied before. Leveraging probabilistic graphical models, this paper presents a novel sequential Bayesian network (SBN) framework for building artificial game-playing agents. We show that many existing agents, including reinforcement learning, fictitious play, and many of their variants, have a unified Bayesian explanation within the proposed SBN framework. Moreover, we discover that SBN can handle various important settings of game playing, allowing for a broad scope of its use in economics. SBN not only provides a unifying and satisfying framework to explain existing learning approaches in virtual economies, but also enables the development of new algorithms that are stronger or have fewer restrictions. In this paper, we derive a new algorithm, Hidden Markovian Play (HMP), from the generic SBN model to handle an important but difficult setting in which a player cannot observe the opponent's strategy and payoff. It leverages Markovian learning to infer unobservable information, leading to higher quality of the agents. Experiments on real-world field experiments in evaluating economies show that our HMP model outperforms the baseline algorithms for building artificial agents.

机译：本文研究了能够模仿游戏中人类行为的基于学习的主体，这是计算经济学中的核心任务。尽管计算经济学家已经开发了各种游戏代理，但之前尚未应用成熟的机器学习方法（例如图形模型）。利用概率图形模型，本文提出了一种新颖的顺序贝叶斯网络（SBN）框架，用于构建人工游戏代理。我们表明，许多现有代理（包括强化学习，虚拟游戏及其许多变体）在建议的SBN框架内具有统一的贝叶斯解释。此外，我们发现SBN可以处理游戏的各种重要设置，从而使其在经济学中具有广泛的用途。 SBN不仅提供一个统一且令人满意的框架来解释虚拟经济中的现有学习方法，而且还可以开发更强大或限制更少的新算法。在本文中，我们从通用SBN模型中衍生出一种新算法，即隐马尔可夫游戏（HMP），以应对玩家无法观察对手的策略和收益的重要但困难的设置。它利用马尔可夫学习来推断无法观察的信息，从而提高了代理的质量。在评估经济性的现实世界中进行的现场实验表明，我们的HMP模型优于构建人工代理的基准算法。

著录项

来源
《Annals of Mathematics and Artificial Intelligence》 |2015年第4期|335-358|共24页
作者
Chen Wenlin; Chen Yixin; Levine David K.;
展开▼
作者单位

Washington Univ, St Louis, MO 63130 USA;

Washington Univ, St Louis, MO 63130 USA;

Washington Univ, St Louis, MO 63130 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Behavioral game theory; Graphical model; Multi-agent learning; Fictitious play; Hidden Markov model;

机译：行为博弈理论;图形模型;多主体学习;虚拟游戏;隐马尔可夫模型;

相似文献

外文文献
中文文献
专利

1. A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems [J] . Predrag T. To?i?, Ricardo Vilalta Procedia Computer Science . 2010,第1期

机译：强化学习，共同学习和元学习的统一框架，如何在协作式多智能体系统中进行协调
2. Synthetic learning agents in game-playing social environments [J] . Kiourt Chairi, Kalles Dimitris Adaptive Behavior . 2016,第6期

机译：游戏性社交环境中的综合学习代理
3. An Intelligent Agent based Novel Framework for Building Management System using Artificial Intelligence [J] . Shabbab A Alhammadi International journal of computer science and network security . 2020,第1期

机译：基于智能代理的建筑物管理系统框架，用于使用人工智能建立管理系统
4. Learning Simulation Control in General Game-Playing Agents [C] . Hilmar Finnsson, Yngvi Bjornsson Innovative applications of artificial intelligence conference;AAAI conference on artificial intelligence;IAAI-10;Symposium on educational advances in artificial intelligence;AAAI-10;EAAI-10 . 2011

机译：通用游戏代理中的学习模拟控制
5. Building an artificial cerebellum using a system of distributed q-learning agents. [D] . Soto Santibanez, Miguel Angel. 2010

机译：使用分布式q学习剂系统构建人工小脑。
6. Representation Learning: A Unified Deep Learning Framework for Automatic Prostate MR Segmentation [O] . Shu Liao, Yaozong Gao, Aytekin Oto, -1

机译：代表学习：一个统一的深学习框架自动前列腺mR分割
7. A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems [O] . Tošić Predrag T., Vilalta Ricardo 2010

机译：强化学习，共同学习和元学习的统一框架，如何在协作式多智能体系统中进行协调
8. Unified Behavior Framework for the Simulation of Autonomous Agents. [R] . Roberson, D. M. 2015

机译：自治agent仿真的统一行为框架。

A unifying learning framework for building artificial game-playing agents

摘要

著录项

相似文献

相关主题

期刊订阅