Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning

Marco A. Wiering

首页> 中文期刊> 《智能学习系统与应用（英文）》 >Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning

Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A promising approach to learn to play board games is to use reinforcement learning algorithms that can learn a game position evaluation function. In this paper we examine and compare three different methods for generating training games: 1) Learning by self-play, 2) Learning by playing against an expert program, and 3) Learning from viewing ex-perts play against each other. Although the third possibility generates high-quality games from the start compared to initial random games generated by self-play, the drawback is that the learning program is never allowed to test moves which it prefers. Since our expert program uses a similar evaluation function as the learning program, we also examine whether it is helpful to learn directly from the board evaluations given by the expert. We compared these methods using temporal difference methods with neural networks to learn the game of backgammon.

著录项

来源
《智能学习系统与应用（英文）》 |2010年第2期|57-68|共12页
作者
Marco A. Wiering;
展开▼
作者单位

不详;

展开▼
原文格式 PDF
正文语种 chi
中图分类肿瘤学;
关键词
Board; Games; Reinforcement; Learning; TD(λ); Self-Play; Learning; From; Demonstration;

相似文献

中文文献
外文文献
专利

Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning

摘要

著录项

相似文献

相关主题

期刊订阅