首页> 外文OA文献 >Investigations into Playing Chess Endgames using Reinforcement Learning.

【2h】

Investigations into Playing Chess Endgames using Reinforcement Learning.

机译：使用强化学习进行国际象棋残局的调查。

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Research in computer game playing has relied primarily on brute force searching approaches rather than any formal AI method. However, these methods may not be able to exceed human ability, as they need human expert knowledge to perform as well as they do. One recently popularized field of research known as reinforcement learning has shown good prospects in overcoming these limitations when applied to non-deterministic games. udThis thesis investigated whether the TD(_) algorithm, one method of reinforcement learning, using standard back-propagation neural networks for function generalization, could successfully learn a deterministic game such as chess. The aim is to determine if an agent using no external knowledge can learn to defeat a random player consistently.udThe results of this thesis suggests that, even though the agents faced a highly information sparse environment, an agent using a well selected view of the state information was still able to learn to not only to differentiate between various terminating board positions but also to improve its play against a random player. This shows that the reinforcement learning techniques are quite capable of learning behaviour in large deterministic environments without needing any external knowledge.

机译：对计算机游戏的研究主要依赖于蛮力搜索方法，而不是任何形式的AI方法。但是，这些方法可能无法超越人类的能力，因为它们需要人类专家知识来像他们一样执行。当被应用到非确定性游戏中时，一个最近广为流行的研究领域称为强化学习，在克服这些局限性方面显示了良好的前景。 ud本文研究了使用标准反向传播神经网络进行功能概括的强化学习方法TD（_）算法能否成功学习象棋这样的确定性游戏。目的是确定不使用外部知识的特工是否可以始终如一地击败随机玩家。 ud本论文的结果表明，即使特工面对高度信息稀疏的环境，特工仍会使用精心选择的视角。状态信息不仅能够学会区分不同的终局位置，而且还能提高其对抗随机玩家的能力。这表明强化学习技术完全有能力在较大的确定性环境中学习行为，而无需任何外部知识。

著录项

作者
Dazeley R;
展开▼
作者单位

展开▼
年度 2001
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. OPTIMAL ROBOT PLAY IN CERTAIN CHESS ENDGAME SITUATIONS [J] . Hartisch Michael, Althoefer Ingo ICGA journal . 2015,第3期

机译：在某些棋类比赛中的最佳机器人比赛
2. COMPUTER CHESS ENDGAME PLAY WITH PAWNS: THEN AND NOW [J] . Newborn M., Hyatt R. ICGA journal . 2014,第4期

机译：用爪子玩计算机棋牌游戏：现在开始
3. EMERGENCE OF COMPLEX STRATEGIES IN THE EVOLUTION OF CHESS ENDGAME PLAYERS [J] . AMI HAUPTMAN, MOSHE SIPPER Advances in complex systems . 2007,第Suppla1期

机译：棋类游戏的演变中出现了复杂的策略
4. GP-EndChess: Using Genetic Programming to Evolve Chess Endgame Players [C] . Ami Hauptman, Moshe Sipper European Conference on Genetic Programming . 2005

机译：gp-demchess：使用遗传编程来发展北象棋终结球员
5. Design and implementation of a chess-playing program in the Java programming language. [D] . Laramee, Francois Dominic. 2002

机译：用Java编程语言设计和实施国际象棋游戏程序。
6. Comprehensive Investigation of White Matter Tracts in Professional Chess Players and Relation to Expertise: Region of Interest and DMRI Connectometry [O] . Mahsa Mayeli, Farzaneh Rahmani, Mohammad Hadi Aarabi 2018

机译：对国际象棋选手中白色物质道的全面调查及其与专业知识的关系：感兴趣的区域和DMRI Connectometry
7. GP-EndChess: Using genetic programming to evolve chess endgame players [O] . Ami Hauptman 2005

机译：GP-EndChess：使用基因编程来发展国际象棋残局玩家
8. Enhanced Experience Replay for Deep Reinforcement Learning. [R] . Doria, D., Dawson, B., Vindiola, M. 2015

机译：增强深度强化学习的体验重播。

Investigations into Playing Chess Endgames using Reinforcement Learning.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅