Intelligent Agents for the Game of Go

首页> 外文期刊>Computational Intelligence Magazine, IEEE >Intelligent Agents for the Game of Go

【24h】

Intelligent Agents for the Game of Go

机译：围棋游戏的智能代理

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Monte-Carlo Tree Search (MCTS) was recently proposed [1, 2, 3] for decision taking in discrete time control problems. It was applied very efficiently to games [4, 5, 6, 7, 8] but also to planning problems and fundamental artificial intelligence tasks [9, 10]. It clearly outperformed alpha-beta techniques when there was no human expertise easy to encode in a value function. In this section, we will describe MCTS and how it allowed great improvements for computer Go. Section II shows the strengths and limitations of MCTS, and in particular, the lack of learning. There are, however, a few known techniques for introducing learning: Rapid-Action Value Estimate (RAVE) and learnt patterns (both well-known now, and discussed below); our focus is on more recent and less widely-known learning techniques introduced in MCTS. The next two sections will show these less standard applications of supervised learning within MCTS: Section III will show how to use past games for improving future games, and section IV will show the inclusion of learning inside a given MCTS run. Section V will be the conclusion.

机译：最近提出了蒙特卡洛树搜索（MCTS）[1、2、3]，用于离散时间控制问题中的决策。它非常有效地应用于游戏[4、5、6、7、8]，而且还用于计划问题和基本的人工智能任务[9、10]。当没有人的专业知识可轻易在值函数中进行编码时，它显然胜过了alpha-beta技术。在本节中，我们将描述MCTS以及它如何为Go电脑带来巨大的改进。第二部分显示了MCTS的优势和局限性，尤其是缺乏学习。但是，有几种引入学习的已知技术：快速行动价值估计（RAVE）和学习模式（现在都众所周知，下面将进行讨论）；我们的重点是MCTS中引入的最新的和鲜为人知的学习技术。接下来的两个部分将展示MCTS中监督学习的这些较不标准的应用：第三部分将展示如何使用过去的游戏来改进未来的游戏，第四部分将展示将学习包含在给定的MCTS运行中。第五节将得出结论。

著录项

来源
《Computational Intelligence Magazine, IEEE》 |2010年第4期|p.28-42|共15页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Winning Is Not Everything: Enhancing Game Development With Intelligent Agents [J] . Zhao Yunqi, Borovikov Igor, de Mesentier Silva Fernando, IEEE Transactions on Games . 2020,第2期

机译：赢得不是一切：通过智能代理增强游戏开发
2. FUZZY COORDINATOR BASED INTELLIGENT AGENTS FOR TEAM COORDINATION BEHAVIOR IN CLOSE COMBAT GAMES [J] . SUPENO MARDI SUSIKI NUGROHO, IKA WIDIASTUTI, MOCHAMAD HARIADI, Journal of Theoretical and Applied Information Technology . 2013,第2期

机译：基于模糊协调员的智能代理，用于密切战斗游戏中的团队协调行为
3. Exploring intelligent agents in three-dimensional games for cognitive stimulation [J] . Rosa Maria Esteves Moreira da Costa, Diogo Soares de Souza, Israel Mendonca dos Santos, International journal on disability and human development : . 2011,第4期

机译：探索三维游戏中的智能主体以刺激认知
4. Catch me if you can: A pursuit-evasion game with intelligent agents in the Unity 3D game environment [C] . İhsan Şahin, Tufan Kumbasar International Conference on Electrical Engineering . 2020

机译：如果可以，请抓住我：在Unity 3D游戏环境中使用智能代理进行追逃游戏
5. Collective learning and cooperation between intelligent software agents: A study of artificial personality and behavior in autonomous agents playing the infinitely repeated prisoner's dilemma game. [D] . Shebalin, Paul Valentine. 1997

机译：智能软件代理之间的集体学习与合作：研究在玩无限次囚徒困境游戏中的自治代理中人为的人格和行为。
6. Autonomous intelligent agents for accelerated materials discovery [O] . Joseph H. Montoya, Kirsten T. Winther, Raul A. Flores, 2020

机译：加速材料发现的自主智能代理
7. Intelligent agents: Conversations from human-agent imitation games [O] . Warwick K., Shah H. 2015

机译：智能代理：来自仿人游戏的对话

Intelligent Agents for the Game of Go

摘要

著录项

相似文献

相关主题

期刊订阅