【24h】

Learning Strategies in Games by Anticipation

机译:预期游戏中的学习策略

获取原文

摘要

Game Theory is maninly seen as a mathematical theory which tries to replace pure chance and intuitive behavior in a competitive situation by calculations. This theory has been widely used to define computer programs. The aim of the research described here is to design an artificial system which is able to play efficiently certain games to which Game Theory cannot be applied satisfactorily (such as games with incomplete or imperfect information). When it cannot find a winning solution, the system is able to play through a process of anticipation. This is done by building and refining a model of the adversary's behavior in real time during the game. The architecture proposed here relies on two genetic classifiers, one of which models the adversaries' behaviors while the other uses the models thus built in order to play. The system's strategy learnign ability has been tested on a simple strategic game. The results show the advantages of this appraoch over human and traditional artificial adversaries (simple probabilistic and adaptive probabilistic) and illustrate how the system learns the strategies used by its aversaries.
机译:博弈论被广泛认为是一种数学理论,试图通过计算来代替竞争情况下的纯粹机会和直觉行为。该理论已被广泛用于定义计算机程序。本文所述研究的目的是设计一种人造系统,该系统能够有效地玩某些不能令人满意地应用博弈论的游戏(例如信息不完整或不完美的游戏)。当它找不到成功的解决方案时,系统便可以进行预期的工作。这是通过在游戏过程中实时建立和完善对手行为的模型来完成的。这里提出的体系结构依赖于两个遗传分类器,其中一个对对手的行为进行建模,而另一个则使用由此构建的模型进行游戏。该系统的策略学习能力已在一个简单的策略游戏中进行了测试。结果表明,该方法优于人类和传统的人工对手(简单概率和自适应概率),并说明了系统如何学习其对手使用的策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号