首页> 外文会议>IEEE International Conference on Tools with Artificial Intelligence >MP-Draughts: Ordering the Search Tree and Refining the Game Board Representation to Improve a Multi-agent System for Draughts
【24h】

MP-Draughts: Ordering the Search Tree and Refining the Game Board Representation to Improve a Multi-agent System for Draughts

机译:MP-Draughts:订购搜索树并完善游戏板表示,以改进草稿的多代理系统

获取原文

摘要

In this paper the authors present an extension of the automatic player system MP-Draughts (MultiPhase-Draughts): a self-learning multi-agent environment for draughts composed of 26 Multiple Layer Perceptrons (MLPs). The weights of the MLPs are updated by Temporal Differences (TD). The search for the best move is conducted by a search algorithm based on Alpha-Beta pruning, Iterative Deepening and Table Transposition. One of the agents is trained in such a way that it becomes an expert in the initial stages of play and the remaining (25), in endgame stages. The endgame boards used to train the endgame agents are retrieved from an endgame board database and clustered by a Kohonem-SOM Neural Network (NN). The same Kohonem-SOM NN will also be used during the games to select which endgame agent is more suitable to play each time the endgame stage of play is reached. In this paper the authors propose the following modifications to improve the performance of MP-Draughts: first, to change the mapping of the board states such that, instead of indicating the presence or not of certain features, it indicates the number of elements pointed out by each feature, second, to order the search tree of each agent in such a way as to attenuate the innumerous re-evaluations of the same board state inherent to the iterative deepening strategy.
机译:在本文中,作者提出了自动播放器系统MP-Draughts(MultiPhase-Draughts)的扩展:一种由26个多层感知器(MLP)组成的草稿的自学习多主体环境。 MLP的权重通过时间差异(TD)进行更新。通过基于Alpha-Beta修剪,迭代加深和表转置的搜索算法进行最佳移动的搜索。其中一名特工经过培训,使其在比赛的初始阶段成为专家,在剩余比赛阶段成为剩余的(25)专家。从终局棋盘数据库中检索用于训练终局代理人的终局棋盘,并通过Kohonem-SOM神经网络(NN)进行聚类。在游戏期间,还将使用相同的Kohonem-SOM NN,以选择每次到达游戏结束阶段时更适合玩哪个游戏结束代理。在本文中,作者建议进行以下修改以提高MP-Draughts的性能:首先,更改板状态的映射,以便指示所指出的元素数量,而不是指示是否存在某些功能。通过每个功能,第二,以某种方式对每个代理的搜索树进行排序,以减弱迭代加深策略固有的对同一板状态的无数重新评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号