Beating the Defense: Using Plan Recognition to Inform Learning Agents

机译：殴打国防：利用计划承认通知学习代理人

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate the hypothesis that plan recognition can significantly improve the performance of a case-based reinforcement learner in an adversarial action selection task. Our environment is a simplification of an American football game. The performance task is to control the behavior of a quarterback in a pass play, where the goal is to maximize yardage gained. Plan recognition focuses on predicting the play of the defensive team. We modeled plan recognition as an unsupervised learning task, and conducted a lesion study. We found that plan recognition was accurate, and that it significantly improved performance. More generally, our studies show that plan recognition reduced the dimensionality of the state space, which allowed learning to be conducted more effectively. We describe the algorithms, explain the reasons for performance improvement, and also describe a further empirical comparison that highlights the utility of plan recognition for this task.

机译：在本文中，我们调查了计划识别可以在对抗诉讼行动选择任务中显着提高基于案例的加强学习者的表现的假设。我们的环境是一项简化美国足球比赛的简化。性能任务是控制通过游戏中四分卫的行为，目标是最大化递码。计划识别侧重于预测防守团队的戏剧。我们将计划识别建模为无监督的学习任务，并进行了病变研究。我们发现计划识别是准确的，并且它显着提高了性能。更一般地，我们的研究表明，计划识别减少了国家空间的维度，这使得允许学习更有效地进行。我们描述了算法，解释了绩效改进的原因，还描述了一个进一步的实证比较，突出了计划识别为此任务的实用性。

著录项

来源
《International Florida Artificial Intelligence Research Society Conference》|2009年||共6页
会议地点
作者
Matthew Molineaux; David W. Aha; Gita Sukthankar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Learning Plan Schemata from Observation: Explanation-Based Learning for Plan Recognition [J] . Raymond J. Mooney Cognitive science . 1990,第4期

机译：从观察中学习计划图式：用于计划识别的基于解释的学习
2. RL-VAEGAN: Adversarial defense for reinforcement learning agents via style transfer [J] . Hu Yueyue, Sun Shiliang Knowledge-Based Systems . 2021,第Juna7期

机译：RL-Vaegan：通过风格转移加固学习代理的对抗防御
3. Distributed Learning for Planning Under Uncertainty Problems with Heterogeneous Teams: Scaling Up the Multiagent Planning with Distributed Learning and Approximate Representations [J] . N. Kemal Ure, Girish Chowdhary, Yu Fan Chen, Journal of Intelligent & Robotic Systems: Theory & Application . 2014,第1a2期

机译：异构团队在不确定性问题下进行规划的分布式学习：利用分布式学习和近似表示来扩展多主体规划
4. Beating the Defense: Using Plan Recognition to Inform Learning Agents [C] . Matthew Molineaux, David W. Aha, Gita Sukthankar Proceedings of the Twenty-Second international Florida Artificial Intelligence Research Society conference . 2009

机译：击败防御：使用计划识别来通知学习代理
5. Plan-based plan recognition models for the effective coordination of agents through observation. [D] . Huber, Marcus James. 1996

机译：基于计划的计划识别模型，用于通过观察有效地协调代理。
6. An Embodied Agent Learning Affordances With Intrinsic Motivations and Solving Extrinsic Tasks With Attention and One-Step Planning [O] . Gianluca Baldassarre, William Lord, Giovanni Granato, 2019

机译：一位经验丰富的特工通过内在动机来学习能力并通过注意力和一步一步的计划来解决外在任务
7. Probabilistic plan recognition for intelligent information agents: Towards proactive software assistant agents [O] . Jean Oh, Felipe Meneguzzi, Katia Sycara 2011

机译：智能信息代理的概率计划识别：迈向主动软件助手代理
8. Beating the Defense: Using Plan Recognition to Inform Learning Agents [R] . Molineaux, M., Aha, D. W., Sukthankar, G. 2009

机译：击败防御：使用计划识别来通知学习代理

Beating the Defense: Using Plan Recognition to Inform Learning Agents

摘要

著录项

相似文献

相关主题

期刊订阅