首页> 外文会议>IEEE Symposium on Computational Intelligence and Games >Coevolutionary Temporal Difference Learning for Othello
【24h】

Coevolutionary Temporal Difference Learning for Othello

机译:奥赛罗的共同态度差异学习

获取原文

摘要

This paper presents Coevolutionary Temporal Difference Learning (CTDL), a novel way of hybridizing coevolutionary search with reinforcement learning that works by interlacing one-population competitive coevolution with temporal difference learning. The coevolutionary part of the algorithm provides for exploration of the solution space, while the temporal difference learning performs its exploitation by local search. We apply CTDL to the board game of Othello, using weighted piece counter for representing players' strategies. The results of an extensive computational experiment demonstrate CTDL's superiority when compared to coevolution and reinforcement learning alone, particularly when coevolution maintains an archive to provide historical progress. The paper investigates the role of the relative intensity of coevolutionary search and temporal difference search, which turns out to be an essential parameter. The formulation of CTDL leads also to the introduction of Lamarckian form of coevolution, which we discuss in detail.
机译:本文介绍了共同差分差异学习(CTDL),一种新颖的一种杂交共同学习的共同学习方式,其通过交错与时间差异学习的一群竞争协调。算法的共施加部分提供了解决方案的探索,而时间差异学习通过本地搜索执行其利用。我们将CTDL应用于Othello的棋盘游戏,使用加权柜台代表玩家的策略。与单独的协调和加强学习相比,广泛计算实验的结果表明CTDL的优势,特别是当共参加档案以提供历史进步时。本文调查了共同调查搜索和时间差异搜索的相对强度的作用,结果是一个重要参数。 CTDL的制剂也引入了引入Lamarckian的参数形式,我们详细讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号