Argumentation-Based Reinforcement Learning for RoboCup Soccer Keepaway

机译：基于参数的强化学习，用于RoboCup足球比赛

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement Learning (RL) suffers from several difficulties when applied to domains with no obvious goal state defined; this leads to inefficiency in RL algorithms. In this paper we consider a solution within the context of a widely-used testbed for RL, that of RoboCup Keepaway soccer. We introduce Argumentation-Based RL (ABRL), using methods from argumentation theory to integrate domain knowledge, represented by arguments, into the SMDP algorithm for RL by using potential-based reward shaping. Empirical results show that ABRL outperforms the original SMDP algorithm, for this game, by improving the optimal performance.

机译：强化学习（RL）应用于没有明确目标状态定义的领域时，会遇到许多困难。这导致RL算法效率低下。在本文中，我们在RL广泛使用的测试平台RoboCup Keepaway足球的测试环境中考虑一种解决方案。我们引入基于议论的RL（ABRL），它使用议论理论的方法，通过使用基于势能的奖励整形，将以论点表示的领域知识集成到RL的SMDP算法中。实验结果表明，对于该游戏，ABRL通过提高最佳性能而优于原始SMDP算法。

著录项

来源
《20th European conference on artificial intelligence》|2012年|342-347|共6页
会议地点 Montpellier(FR)
作者
Yang Gao; Francesca Toni; Robert Craven;
展开▼
作者单位

Imperial College London, UK;

Imperial College London, UK;

Imperial College London, UK;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning for RoboCup Soccer Keepaway [J] . Peter Stone, Richard S. Sutton, Gregory Kuhlmann Adaptive Behavior . 2005,第3期

机译：RoboCup足球禁区的强化学习
2. マルチエージェント連続タスクにおける報酬設計の実験的考察－RoboCup Soccer Keepaway タスクを例として [J] . 荒井幸代, 田中信行, Sachiyo Arai, 人工知能学会論文誌 . 2006,第6期

机译：多Agent连续任务中奖励设计的实验考虑-以RoboCup足球禁忌任务为例
3. マルチエージェント連続タスクにおける報酬設計の実験的考察－RoboCup Soccer Keepaway タスクを例として [J] . 荒井幸代, 田中信行, Sachiyo Arai, 人工知能学会論文誌 . 2006,第6期

机译：多售后持续任务 - Robocup足球昆虫淘场任务的补偿设计实验研究作为示例
4. Argumentation-Based Reinforcement Learning for RoboCup Soccer Keepaway [C] . Yang Gao, Francesca Toni, Robert Craven European Conference on Artificial Intelligence . 2012

机译：基于论点的强化钢筋儿童守门员
5. A scene learning and recognition framework for RoboCup clients. [D] . Lam, Kevin. 2005

机译：针对RoboCup客户的场景学习和识别框架。
6. Structure-Preserving Imitation Learning With Delayed Reward: An Evaluation Within the RoboCup Soccer 2D Simulation Environment [O] . Quang Dang Nguyen, Mikhail Prokopenko 2020

机译：延迟奖励的结构保留模仿学习：Robocup Soccer 2D模拟环境中的评估
7. Two steps reinforcement learning en robocup-soccer keepaway [O] . López-Bueno Hernández Iván 2009

机译：两步强化学习en robocup-soccer keepaway

Argumentation-Based Reinforcement Learning for RoboCup Soccer Keepaway

摘要

著录项

相似文献

相关主题

期刊订阅