Self-play reinforcement learning with comprehensive critic in computer games

Liu Shanqi; Cao Junjie; Wang Yujie; Chen Wenzhou; Liu Yong

首页> 外文期刊>Neurocomputing >Self-play reinforcement learning with comprehensive critic in computer games

【24h】

Self-play reinforcement learning with comprehensive critic in computer games

机译：在电脑游戏中的综合评论家自助增强学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Self-play reinforcement learning, where agents learn by playing with themselves, has been successfully applied in many game scenarios. However, the training procedure for self-play reinforcement learning is unstable and more sample-inefficient than (general) reinforcement learning, especially in imperfect information games. To improve the self-play training process, we incorporate a comprehensive critic into the policy gradient method to form a self-play actor-critic (SPAC) method for training agents to play com-puter games. We evaluate our method in four different environments in both competitive and coopera-tive tasks. The results show that the agent trained with our SPAC method outperforms those trained with deep deterministic policy gradient (DDPG) and proximal policy optimization (PPO) algorithms in many different evaluation approaches, which vindicate the effect of our comprehensive critic in the self-play training procedure. CO 2021 Elsevier B.V. All rights reserved.

机译：在许多游戏场景中成功地应用了自助增强学习，代理商学习，已经成功地应用了许多游戏场景。然而，自助增强学习的培训程序比（一般）加强学习更不稳定，更像是更高的样品效率，特别是在不完美的信息游戏中。为了提高自助培训流程，我们将全面的批评融入政策渐变方法，以形成自行运动员 - 评论家（SPAC）方法，用于培训代理商播放COM-PUTER Games。我们在竞争和合作社任务中评估了四种不同环境中的方法。结果表明，随着我们的SPAC方法培训的代理商优于许多不同评估方法中具有深度确定性政策梯度（DDPG）和近端政策优化（PPO）算法培训的代理人，这使我们在自助培训中的综合评论家的效果程序。 CO 2021 elestvier b.v.保留所有权利。

著录项

来源
《Neurocomputing》 |2021年第18期|207-213|共7页
作者
Liu Shanqi; Cao Junjie; Wang Yujie; Chen Wenzhou; Liu Yong;
展开▼
作者单位

Zhejiang Univ Inst Cyber Syst & Control Hangzhou Peoples R China;

Zhejiang Univ Inst Cyber Syst & Control Hangzhou Peoples R China;

Zhejiang Univ Inst Cyber Syst & Control Hangzhou Peoples R China;

Zhejiang Univ Inst Cyber Syst & Control Hangzhou Peoples R China;

Zhejiang Univ Inst Cyber Syst & Control Hangzhou Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Reinforcement learning; Self-play; Computer game;

机译：加固学习;自我扮演;电脑游戏;

相似文献

外文文献
中文文献
专利

1. Actor-Critic Reinforcement Learning and Application in Developing Computer-Vision-Based Interface Tracking [J] . Oguzhan Dogru, Kirubakaran Velswamy, Biao Huang 工程（英文） . 2021,第009期

机译：Actor-Critic Reinforcement Learning and Application in Developing Computer-Vision-Based Interface Tracking
2. Research on the Difficulty of Mobile Node Deployment’s Self-Play in Wireless Ad Hoc Networks Based on Deep Reinforcement Learning [J] . Huitao Wang, Ruopeng Yang, Changsheng Yin, Wireless communications & mobile computing . 2021,第a期

机译：基于深度加强学习的无线临时网络中移动节点部署自助难度的研究
3. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play [J] . Silver David, Hubert Thomas, Schrittwieser Julian, Science . 2018,第6419期

机译：一种通用的强化学习算法，可掌握国际象棋，将棋和自打法
4. Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play [C] . van der Ree Michiel, Wiering Marco IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning . 2013

机译：在《奥赛罗》游戏中的强化学习：对固定对手的学习和自学的学习
5. The comparison of individualized computer game reinforcement versus peer-interactive board game reinforcement on nutrition label knowledge retention of fifth graders. [D] . Grechus, Marilyn Lou. 1997

机译：比较个人计算机游戏强化和同伴互动棋盘游戏强化对五年级学生营养标签知识的保留。
6. Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection [O] . Kun Zhou, Wenyong Wang, Teng Hu, 2021

机译：改进异步优势演员批评批评学习模型在异常检测中的应用
7. Affect and the computer game player: the effect of gender, personality, and game reinforcement structure on affective responses to computer game-play [O] . Chumbley, J, Griffiths, MD 2006

机译：情感和计算机游戏玩家：性别，个性和游戏强化结构对计算机游戏玩法的情感反应的影响

Self-play reinforcement learning with comprehensive critic in computer games

摘要

著录项

相似文献

相关主题

期刊订阅