首页> 外文会议>European conference on applications of evolutionary computation >Multi-Criteria Comparison of Coevolution and Temporal Difference Learning on Othello
【24h】

Multi-Criteria Comparison of Coevolution and Temporal Difference Learning on Othello

机译:Othello上的协同进化和时差学习的多标准比较

获取原文

摘要

We compare Temporal Difference Learning (TDL) with Coevolutionary Learning (CEL) on Othello. Apart from using three popular single-criteria performance measures: (ⅰ) generalization performance or expected utility, (ⅱ) average results against a hand-crafted heuristic and (ⅲ) result in a head to head match, we compare the algorithms using performance profiles. This multi-criteria performance measure characterizes player's performance in the context of opponents of various strength. The multi-criteria analysis reveals that although the generalization performance of players produced by the two algorithms is similar, TDL is much better at playing against strong opponents, while CEL copes better against weak ones. We also find out that the TDL produces less diverse strategies than CEL. Our results confirms the usefulness of performance profiles as a tool for comparison of learning algorithms for games.
机译:我们将时间差异学习(TDL)与Othello上的协同进化学习(CEL)进行了比较。除了使用三种流行的单一标准性能指标:(ⅰ)泛化性能或预期效用,(ⅱ)相对于手工启发式算法的平均结果和(ⅲ)正面对头的匹配之外,我们还使用性能概况来比较算法。这项多标准的表现指标可以表征玩家在各种实力的对手中的表现。多准则分析表明,尽管两种算法产生的选手的综合表现相似,但TDL在对抗强势对手方面要好得多,而CEL在对抗弱势对手方面要更好。我们还发现,与CEL相比,TDL产生的多样性策略更少。我们的结果证实了性能概况作为比较游戏学习算法的工具的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号