首页> 外文会议>Australasian Joint Conference on Artificial Intelligence >Exploring a Learning Architecture for General Game Playing
【24h】

Exploring a Learning Architecture for General Game Playing

机译:探索普通游戏播放的学习架构

获取原文
获取外文期刊封面目录资料

摘要

General Game Playing (GGP) is a platform for developing general Artificial Intelligence algorithms to play a large variety of games that are unknown to players in advance. This paper describes and analyses GGPZero, a learning architecture for GGP, inspired by the success of AlphaGo and AlphaZero. GGPZero takes as input a previously unknown game description and constructs a deep neural network to be trained using self-play together with Monte-Carlo Tree Search. The general architecture of GGPZero is similar to that of Goldwaser and Thielscher (2020) with the main differences in the choice of the GGP reasoner and the neural network construction; furthermore, we explore additional experimental evaluation strategies. Our main contributions are: confirming the feasibility of deep reinforcement for GGP, analysing the impact of the type and depth of the underlying neural network, and investigating simulation vs. time limitations on training.
机译:普通游戏播放(GGP)是开发一般人工智能算法的平台,以便提前玩过的球员未知的各种游戏。 本文介绍和分析了GGPP,这是GGP的学习架构,受到Alphago和Alphazero的成功的启发。 GGPzero将作为输入的输入一个先前未知的游戏描述,并构建一个深度神经网络,用于使用自助游戏与Monte-Carlo树搜索进行培训。 GGPzero的总体体系结构类似于Goldwaser和Thielscher(2020)的架构,主要差异是GGP推理和神经网络建设的主要差异; 此外,我们探讨了其他实验评估策略。 我们的主要贡献是:确认GGP的深度加强的可行性,分析了潜在神经网络的类型和深度的影响,以及调查模拟与训练时间限制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号