首页> 外文会议>2017 Intelligent Systems Conference >Exploiting action categories in learning complex games

【24h】

Exploiting action categories in learning complex games

机译：在学习复杂游戏中利用动作类别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a model for planning in a highly complex game, where certain action types are more common than others and cyclic behaviour can also easily arise. These issues are addressed by exploiting the inherent structure among the possible options to enhance the online learning algorithm: sampling during Monte Carlo Tree Search becomes a two step process, by first sampling from a distribution over the types of legal actions followed by sampling from individual actions of the chosen type. This policy drastically reduces the breadth of the rollout as well as its depth by avoiding redundant sampling behaviour. The result is a large increase in both the performance and efficiency of the model. Another contribution of this paper is assessing the benefits of a parallel implementation and afterstates in complex games. Evaluation is done via agent simulations in the board game Settlers of Catan. The resulting agent is the first based on purely online learning strategies that can handle the full set of legal actions of the game. The evaluation shows that our model outperforms previous state-of-the-art agents while taking decisions in a time threshold tolerated by human opponents.

机译：本文提出了一种在高度复杂的游戏中进行规划的模型，其中某些动作类型比其他动作类型更为常见，并且循环行为也很容易出现。通过利用可能的选项中的固有结构来增强在线学习算法，可以解决这些问题：蒙特卡洛树搜索期间的采样成为一个两步过程，首先从合法行为类型的分布中采样，然后从单个行为中采样所选类型的。通过避免重复的采样行为，此策略极大地降低了部署的宽度和深度。结果大大提高了模型的性能和效率。本文的另一项贡献是评估了复杂游戏中并行实现和后状态的好处。通过棋盘游戏《卡坦的定居者》中的特工模拟来进行评估。最终的代理商是第一个基于纯粹在线学习策略的代理商，该策略可以处理游戏的所有法律诉讼。评估显示，我们的模型在人类对手可以忍受的时间阈值内做出决策的同时，胜过了以往的最新代理。

著录项

来源
《2017 Intelligent Systems Conference 》|2017年|729-737|共9页
会议地点 London(GB)
作者
Mihai S. Dobre; Alex Lascarides;
展开▼
作者单位

School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, Scotland;

School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, Scotland;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Games; Planning; Law; Monte Carlo methods; Learning (artificial intelligence); Complexity theory;

机译：游戏;规划;法律;蒙特卡洛方法;学习（人工智能）;复杂性理论;;

相似文献

外文文献
中文文献
专利

1. Learning English With The Sims: Exploiting Authentic Computer Simulation Games For L2 Learning [J] . Jim Ranalli Computer assisted language learning . 2008 ,第5期

机译：与模拟市民一起学习英语：利用真正的计算机模拟游戏进行L2学习
2. Learning through playing Virtual Age: Exploring the interactions among student concept learning, gaming performance, in-game behaviors, and the use of in-game characters [J] . Cheng Meng-Tzu, Lin Yu-Wen, She Hsiao-Ching Computers & education . 2015 ,第AUGa期

机译：通过玩虚拟时代来学习：探索学生概念学习，游戏性能，游戏中行为以及游戏中角色使用之间的相互作用
3. Exploiting abstractions for grammar-based learning of complex multi-agent behaviours [J] . Dilini Samarasinghe, Michael Barlow, Erandi Lakshika, International Journal of Intelligent Systems . 2021 ,第11期

机译：利用基于语法的复杂多智能经纪行为学习的抽象
4. Exploiting action categories in learning complex games [C] . Mihai S. Dobre, Alex Lascarides Intelligent Systems Conference . 2017

机译：在学习复杂游戏中开发行动类别
5. Game Learning Analytics and QualitativeMethods for Actionable Change in a Curriculum-Integrated EducationalMath Game [D] . Peddycord-Liu, Zhongxiu Aurora. 2018

机译：游戏学习分析和QualitativeMethod在课程综合教育专业课程中的可行变化
6. Learning exploitation and bias in games [O] . John M. McNamara, Alasdair I. Houston, Olof Leimar 2021

机译：游戏中的学习剥削和偏见
7. Exploiting action categories in learning complex games [O] . Mihai S. Dobre, Alex Lascarides 2017

机译：在学习复杂游戏中开发行动类别

Exploiting action categories in learning complex games

摘要

著录项

相似文献

相关主题

期刊订阅