首页> 外文会议>Autonomous and intelligent systems >Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker

【24h】

Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker

机译：不完全信息游戏中适应对手模型的策略：扑克的强化学习方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Researching into the incomplete information games (IIG) field requires the development of strategies which focus on optimizing the decision making process, as there is no unequivocal best choice for a particular play. As such, this paper describes the development process and testing of an agent able to compete against human players on Poker - one of the most popular IIG. The used methodology combines pre-defined opponent models with a reinforcement learning approach. The decision-making algorithm creates a different strategy against each type of opponent by identifying the opponent's type and adjusting the rewards of the actions of the corresponding strategy. The opponent models are simple classifications used by Poker experts. Thus, each strategy is constantly adapted throughout the games, continuously improving the agent's performance. In light of this, two agents with the same structure but different rewarding conditions were developed and tested against other agents and each other. The test results indicated that after a training phase the developed strategy is capable of outperforming basic/intermediate playing strategies thus validating this approach.

机译：对不完全信息游戏（IIG）领域的研究需要开发专注于优化决策过程的策略，因为对于特定游戏没有明确的最佳选择。因此，本文描述了一种能够与扑克上的人类玩家竞争的代理商的开发过程和测试-扑克是最受欢迎的IIG之一。使用的方法将预定义的对手模型与强化学习方法结合在一起。决策算法通过识别对手的类型并调整相应策略的动作奖励来针对每种类型的对手创建不同的策略。对手模型是扑克专家使用的简单分类。因此，在整个游戏过程中，每种策略都会不断调整，从而不断提高代理商的表现。有鉴于此，开发了具有相同结构但奖励条件不同的两个代理，并针对其他代理进行了测试。测试结果表明，在训练阶段之后，已开发的策略能够胜过基本/中级的游戏策略，从而验证了这种方法。

著录项

来源
《Autonomous and intelligent systems》|2012年|220-227|共8页
会议地点 Aveiro(PT)
作者
Luis Filipe Teofilo; Nuno Passos; Luis Paulo Reis; Henrique Lopes Cardoso;
展开▼
作者单位

LIACC - Artificial Intelligence and Computer Science Lab., University of Porto, Portugal,FEUP - Faculty of Engineering, University of Porto, DEI, Portugal;

FEUP - Faculty of Engineering, University of Porto, DEI, Portugal;

LIACC - Artificial Intelligence and Computer Science Lab., University of Porto, Portugal,EEUM - School of Engineering, University of Minho, DSI, Portugal;

LIACC - Artificial Intelligence and Computer Science Lab., University of Porto, Portugal,FEUP - Faculty of Engineering, University of Porto, DEI, Portugal;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Incomplete Information Games; Opponent Modeling; Reinforcement Learning; Poker;

机译：不完整的信息游戏；对手建模；强化学习；扑克;

相似文献

外文文献
中文文献
专利

1. An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning [J] . Masahiro Ono, Mitsuru Shiozaki, Mamoru Sasaki, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2003,第228期

机译：基于强化学习的对手特征自适应策略模型
2. An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning [J] . Masahiro Ono, Mitsuru Shiozaki, Mamoru Sasaki, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2003,第228期

机译：基于强化学习的对手特征自适应策略模型
3. Adapting attackers and defenders patrolling strategies: A reinforcement learning approach for Stackelberg security games [J] . Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak Journal of computer and system sciences . 2018,第AUGa期

机译：调整攻击者和防御者的巡逻策略：Stackelberg安全游戏的强化学习方法
4. Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker [C] . Luis Filipe Teofilo, Nuno Passos, Luis Paulo Reis, International Conference on Autonomous and Intelligent Systems . 2012

机译：在不完整信息游戏中调整对手模型的策略：扑克的加强学习方法
5. Reinforcement learning in stochastic games against bounded memory opponents. [D] . Vrljicak, Tomislav. 2006

机译：针对随机记忆对手的随机游戏中的强化学习。
6. Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning [O] . Guillaume Viejo, Mehdi Khamassi, Andrea Brovelli, 2015

机译：通过自适应工作记忆和强化学习的协调对任意视觉运动学习中的选择和反应时间建模
7. Bayes-relational learning of opponent models from incomplete information in no-limit poker [O] . Ponsen Marc, Ramon Jan, Croonenborghs Tom, 2008

机译：从无限制扑克中的不完全信息中对手模型的贝叶斯关系学习

Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker

摘要

著录项

相似文献

相关主题

期刊订阅