【24h】

Multiagent Reactive Plan Application Learning in Dynamic Environments

机译:动态环境中的多主体反应式计划应用学习

获取原文
获取原文并翻译 | 示例

摘要

In addition to bottom-up learning approaches, which facilitate emergent policy learning, it also is desirable to have top-down control over learning so that a team of agents can also learn to apply general policies to diverse dynamic situations. We present a. multiagent case-based learning methodology to achieve this top-down control. In this methodology, high-level symbolic plans describe policies a team of agents needs to learn to apply to different situations. For each plan whose preconditions match their current team state, agents learn to operationalize that plan. In each training scenario, each agent learns a sequence of actions that implements each step in the given plan such that the entire plan is opera-tionalized under current external conditions. This application knowledge is acquired via searching through a small set of available high-level actions and testing the success of each sequence of actions in the situated environment. Similarity between a new situation and existing cases is measured by considering only the state internal to the team, and an agent stores the successful sequence of actions in the current plan step indexed under the current external state. By repeating this process for each plan step using many diverse training scenarios, a team of agents learns how to operationalize an entire plan in a wide variety of external situations, hence achieving generality. We demonstrate our approach using the RoboCup soccer simulator.
机译:除了自底向上的学习方法(它有助于紧急策略学习)之外,还希望对学习进行自上而下的控制,以便代理团队也可以学习将通用策略应用于各种动态情况。我们提出一个。基于案例的多主体学习方法可实现这种自上而下的控制。在这种方法中,高级符号计划描述了座席团队需要学习以应用于不同情况的策略。对于每个前提条件与其当前团队状态相匹配的计划,座席将学习如何实施该计划。在每种培训方案中,每个代理都学习一系列操作,这些操作将执行给定计划中的每个步骤,从而使整个计划在当前外部条件下可操作。通过搜索一小组可用的高级操作并在所处环境中测试每个操作序列的成功性来获取此应用程序知识。通过仅考虑团队内部的状态来度量新情况与现有案例之间的相似性,并且代理将成功的操作顺序存储在当前外部状态下编制索引的当前计划步骤中。通过使用多种多样的培训方案在每个计划步骤中重复此过程,特工团队将学习如何在各种外部情况下实施整个计划,从而实现通用性。我们使用RoboCup足球模拟器演示了我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号