首页> 外文会议>International Conference on Autonomous and Intelligent Systems >Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker

【24h】

Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker

机译：在不完整信息游戏中调整对手模型的策略：扑克的加强学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Researching into the incomplete information games (IIG) field requires the development of strategies which focus on optimizing the decision making process, as there is no unequivocal best choice for a particular play. As such, this paper describes the development process and testing of an agent able to compete against human players on Poker - one of the most popular IIG. The used methodology combines pre-defined opponent models with a reinforcement learning approach. The decision-making algorithm creates a different strategy against each type of opponent by identifying the opponent's type and adjusting the rewards of the actions of the corresponding strategy. The opponent models are simple classifications used by Poker experts. Thus, each strategy is constantly adapted throughout the games, continuously improving the agent's performance. In light of this, two agents with the same structure but different rewarding conditions were developed and tested against other agents and each other. The test results indicated that after a training phase the developed strategy is capable of outperforming basic/intermediate playing strategies thus validating this approach.

机译：研究进入不完整的信息游戏（IIG）领域需要开发专注于优化决策过程的策略，因为特定戏剧没有明确的最佳选择。因此，本文介绍了能够与扑克上的人类参与者竞争的代理商的开发过程和测试 - 最受欢迎的IIG之一。使用的方法将预定义的对手模型与加强学习方法相结合。决策算法通过识别对手的类型和调整相应的战略行动的奖励创建针对每种类型的对手不同的策略。对手模型是扑克专家使用的简单分类。因此，每个策略在整个游戏中不断调整，不断提高代理商的表现。鉴于此，两种具有相同结构但具有不同益处条件的试剂并对其他药剂进行开发并彼此测试。测试结果表明，在培训阶段之后，发达的策略能够优化基本/中间竞争策略，从而验证这种方法。

著录项

来源
《International Conference on Autonomous and Intelligent Systems 》|2012年||共8页
会议地点
作者
Luis Filipe Teofilo; Nuno Passos; Luis Paulo Reis; Henrique Lopes Cardoso;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Incomplete Information Games; Opponent Modeling; Reinforcement Learning; Poker;

机译：信息游戏不完整;对手建模;加强学习;扑克;

相似文献

外文文献
中文文献
专利

1. An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning [J] . Masahiro Ono, Mitsuru Shiozaki, Mamoru Sasaki, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2003 ,第228期

机译：基于强化学习的对手特征自适应策略模型
2. An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning [J] . Masahiro Ono, Mitsuru Shiozaki, Mamoru Sasaki, 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2003 ,第228期

机译：基于强化学习的对手特征自适应策略模型
3. Adapting attackers and defenders patrolling strategies: A reinforcement learning approach for Stackelberg security games [J] . Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak Journal of computer and system sciences . 2018 ,第AUGa期

机译：调整攻击者和防御者的巡逻策略：Stackelberg安全游戏的强化学习方法
4. Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker [C] . Luis Filipe Teofilo, Nuno Passos, Luis Paulo Reis, Autonomous and intelligent systems . 2012

机译：不完全信息游戏中适应对手模型的策略：扑克的强化学习方法
5. Reinforcement learning in stochastic games against bounded memory opponents. [D] . Vrljicak, Tomislav. 2006

机译：针对随机记忆对手的随机游戏中的强化学习。
6. Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning [O] . Guillaume Viejo, Mehdi Khamassi, Andrea Brovelli, 2015

机译：通过自适应工作记忆和强化学习的协调对任意视觉运动学习中的选择和反应时间建模
7. Bayes-relational learning of opponent models from incomplete information in no-limit poker [O] . Ponsen Marc, Ramon Jan, Croonenborghs Tom, 2008

机译：从无限制扑克中的不完全信息中对手模型的贝叶斯关系学习

Adapting Strategies to Opponent Models in Incomplete Information Games: A Reinforcement Learning Approach for Poker

摘要

著录项

相似文献

相关主题

期刊订阅