首页> 外文会议>Autonomous and intelligent systems >Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker
【24h】

Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker

机译:不完全信息游戏中适应对手模型的策略:扑克的强化学习方法

获取原文
获取原文并翻译 | 示例

摘要

Researching into the incomplete information games (IIG) field requires the development of strategies which focus on optimizing the decision making process, as there is no unequivocal best choice for a particular play. As such, this paper describes the development process and testing of an agent able to compete against human players on Poker - one of the most popular IIG. The used methodology combines pre-defined opponent models with a reinforcement learning approach. The decision-making algorithm creates a different strategy against each type of opponent by identifying the opponent's type and adjusting the rewards of the actions of the corresponding strategy. The opponent models are simple classifications used by Poker experts. Thus, each strategy is constantly adapted throughout the games, continuously improving the agent's performance. In light of this, two agents with the same structure but different rewarding conditions were developed and tested against other agents and each other. The test results indicated that after a training phase the developed strategy is capable of outperforming basic/intermediate playing strategies thus validating this approach.
机译:对不完全信息游戏(IIG)领域的研究需要开发专注于优化决策过程的策略,因为对于特定游戏没有明确的最佳选择。因此,本文描述了一种能够与扑克上的人类玩家竞争的代理商的开发过程和测试-扑克是最受欢迎的IIG之一。使用的方法将预定义的对手模型与强化学习方法结合在一起。决策算法通过识别对手的类型并调整相应策略的动作奖励来针对每种类型的对手创建不同的策略。对手模型是扑克专家使用的简单分类。因此,在整个游戏过程中,每种策略都会不断调整,从而不断提高代理商的表现。有鉴于此,开发了具有相同结构但奖励条件不同的两个代理,并针对其他代理进行了测试。测试结果表明,在训练阶段之后,已开发的策略能够胜过基本/中级的游戏策略,从而验证了这种方法。

著录项

  • 来源
  • 会议地点 Aveiro(PT)
  • 作者单位

    LIACC - Artificial Intelligence and Computer Science Lab., University of Porto, Portugal,FEUP - Faculty of Engineering, University of Porto, DEI, Portugal;

    FEUP - Faculty of Engineering, University of Porto, DEI, Portugal;

    LIACC - Artificial Intelligence and Computer Science Lab., University of Porto, Portugal,EEUM - School of Engineering, University of Minho, DSI, Portugal;

    LIACC - Artificial Intelligence and Computer Science Lab., University of Porto, Portugal,FEUP - Faculty of Engineering, University of Porto, DEI, Portugal;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Incomplete Information Games; Opponent Modeling; Reinforcement Learning; Poker;

    机译:不完整的信息游戏;对手建模;强化学习;扑克;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号