首页> 外文会议>International Conference on Autonomous and Intelligent Systems >Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker
【24h】

Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker

机译:在不完整信息游戏中调整对手模型的策略:扑克的加强学习方法

获取原文

摘要

Researching into the incomplete information games (IIG) field requires the development of strategies which focus on optimizing the decision making process, as there is no unequivocal best choice for a particular play. As such, this paper describes the development process and testing of an agent able to compete against human players on Poker - one of the most popular IIG. The used methodology combines pre-defined opponent models with a reinforcement learning approach. The decision-making algorithm creates a different strategy against each type of opponent by identifying the opponent's type and adjusting the rewards of the actions of the corresponding strategy. The opponent models are simple classifications used by Poker experts. Thus, each strategy is constantly adapted throughout the games, continuously improving the agent's performance. In light of this, two agents with the same structure but different rewarding conditions were developed and tested against other agents and each other. The test results indicated that after a training phase the developed strategy is capable of outperforming basic/intermediate playing strategies thus validating this approach.
机译:研究进入不完整的信息游戏(IIG)领域需要开发专注于优化决策过程的策略,因为特定戏剧没有明确的最佳选择。因此,本文介绍了能够与扑克上的人类参与者竞争的代理商的开发过程和测试 - 最受欢迎的IIG之一。使用的方法将预定义的对手模型与加强学习方法相结合。决策算法通过识别对手的类型和调整相应的战略行动的奖励创建针对每种类型的对手不同的策略。对手模型是扑克专家使用的简单分类。因此,每个策略在整个游戏中不断调整,不断提高代理商的表现。鉴于此,两种具有相同结构但具有不同益处条件的试剂并对其他药剂进行开发并彼此测试。测试结果表明,在培训阶段之后,发达的策略能够优化基本/中间竞争策略,从而验证这种方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号