首页> 外文会议>International Florida Artificial Intelligence Research Society Conference >Beating the Defense: Using Plan Recognition to Inform Learning Agents
【24h】

Beating the Defense: Using Plan Recognition to Inform Learning Agents

机译:殴打国防:利用计划承认通知学习代理人

获取原文

摘要

In this paper, we investigate the hypothesis that plan recognition can significantly improve the performance of a case-based reinforcement learner in an adversarial action selection task. Our environment is a simplification of an American football game. The performance task is to control the behavior of a quarterback in a pass play, where the goal is to maximize yardage gained. Plan recognition focuses on predicting the play of the defensive team. We modeled plan recognition as an unsupervised learning task, and conducted a lesion study. We found that plan recognition was accurate, and that it significantly improved performance. More generally, our studies show that plan recognition reduced the dimensionality of the state space, which allowed learning to be conducted more effectively. We describe the algorithms, explain the reasons for performance improvement, and also describe a further empirical comparison that highlights the utility of plan recognition for this task.
机译:在本文中,我们调查了计划识别可以在对抗诉讼行动选择任务中显着提高基于案例的加强学习者的表现的假设。我们的环境是一项简化美国足球比赛的简化。性能任务是控制通过游戏中四分卫的行为,目标是最大化递码。计划识别侧重于预测防守团队的戏剧。我们将计划识别建模为无监督的学习任务,并进行了病变研究。我们发现计划识别是准确的,并且它显着提高了性能。更一般地,我们的研究表明,计划识别减少了国家空间的维度,这使得允许学习更有效地进行。我们描述了算法,解释了绩效改进的原因,还描述了一个进一步的实证比较,突出了计划识别为此任务的实用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号