首页> 外文期刊>Knowledge-Based Systems >Scalable sub-game solving for imperfect-information games
【24h】

Scalable sub-game solving for imperfect-information games

机译:可扩展的子游戏解决不完美信息游戏

获取原文
获取原文并翻译 | 示例
           

摘要

Counterfactual regret minimization (CFR) is a popular and effective method for solving a game with imperfect information. The effect of CFR is limited by the size of the game state space. With the increase in the number of game participants, the game state space will increase rapidly. Although the vanilla CFR is suitable for two-player imperfect-information games, it does not work well in imperfect-information games with three or more players. In this paper, we design a framework for imperfect-information games, which can not only deal with two-player imperfect-information games but also can efficiently solve three-player imperfect-information games. Compared with traditional solving methods, in this framework we propose real-time hand abstraction (RTHA), which can reduce the error caused by the abstraction. We also propose a warm-start online solution of sub-game (WSOS-SG) method, which can improve the accuracy of the action estimation and solve the sub-game in real time. Experimental results show that the agent based on our method achieve better performances than traditional methods. The agent based on our method took part in the 2018 AAAI-ACPC poker competition and won third place in heads-up no-limit Texas hold'em. (C) 2021 Elsevier B.V. All rights reserved.
机译:反事实遗憾最小化(CFR)是解决具有不完美信息的游戏的流行有效的方法。 CFR的效果受游戏状态空间的大小的限制。随着游戏参与者数量的增加,游戏状态空间将迅速增加。虽然Vanilla CFR适用于双人不完全信息游戏,但它在具有三名或更多名玩家的不完美信息游戏中它不起作用。在本文中,我们为不完美信息游戏设计了框架,这不仅可以处理双人不完美信息游戏,还可以有效地解决三位玩家不完美信息游戏。与传统求解方法相比,在本框架中,我们提出了实时手绘(RTHA),这可以减少抽象引起的错误。我们还提出了一种热门启动的亚游戏(WSOS-SG)方法解决方案,可以提高动作估计的准确性,并实时解决子游戏。实验结果表明,基于我们的方法的代理比传统方法实现更好的表现。基于我们的方法的代理人参加了2018年AAAI-ACPC扑克竞赛,并赢得了第三位,赢得了德克萨斯州的头部无限制德克萨斯州。 (c)2021 elestvier b.v.保留所有权利。

著录项

  • 来源
    《Knowledge-Based Systems》 |2021年第14期|107434.1-107434.11|共11页
  • 作者单位

    Harbin Inst Technol Sch Comp Sci & Technol Shenzhen 518055 Peoples R China;

    Harbin Inst Technol Sch Comp Sci & Technol Shenzhen 518055 Peoples R China|Peng Cheng Lab Shenzhen 518038 Peoples R China;

    Harbin Inst Technol Sch Comp Sci & Technol Shenzhen 518055 Peoples R China;

    Harbin Inst Technol Sch Comp Sci & Technol Shenzhen 518055 Peoples R China;

    Harbin Inst Technol Sch Comp Sci & Technol Shenzhen 518055 Peoples R China;

    Harbin Inst Technol Sch Comp Sci & Technol Shenzhen 518055 Peoples R China;

    Harbin Inst Technol Sch Comp Sci & Technol Shenzhen 518055 Peoples R China|Peng Cheng Lab Shenzhen 518038 Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Game; Counterfactual regret minimization; Imperfect-information; Agent;

    机译:游戏;反事实后悔最小化;不完美的信息;代理商;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号