首页> 外文会议>AIAA SciTech Forum and Exposition >Utilizing Reinforcement Learning to Continuously Improve a Primitive-Based Motion Planner
【24h】

Utilizing Reinforcement Learning to Continuously Improve a Primitive-Based Motion Planner

机译:利用钢筋学习,不断改进基于原始的运动计划

获取原文

摘要

This paper describes how the performance of motion primitive based planning algorithms can be improved using reinforcement learning. Specifically, we describe and evaluate a framework for policy improvement via the discovery of new motion primitives. Our approach combines the predictable behavior of deterministic planning methods with the exploration capability of reinforcement learning. The framework consists of three phases: evaluation, exploration, and extraction. This framework can be iterated continuously to provide successive improvement. The evaluation step scores the performance of a motion primitive library using value iteration to create a cost map. A local difference metric is then used to identify regions that need improvement. The exploration step utilizes reinforcement learning to examine new trajectories in the regions of greatest need. The extraction step encodes the agent's experiences into new primitives. The framework is tested on a point-to-point navigation task using a 6-DOF nonlinear F-16 model. One iteration of the framework discovered 17 new primitives and provided a maximum planning time reduction of 96.91 %. After 3 full iterations, 123 primitives were added with a maximum time reduction of 97.39%. The proposed framework is easily extensible to a range of vehicles, environments, and cost functions.
机译:本文介绍了如何利用增强学习改进运动原基本规划算法的性能。具体而言,我们通过发现新的运动原语来描述和评估政策改进的框架。我们的方法结合了确定性规划方法的可预测行为,以勘探学习的勘探能力。该框架由三个阶段组成:评估,探索和提取。此框架可以连续迭代以提供连续的改进。评估步骤使用价值迭代来创建成本映射的运动原始库的性能。然后使用局部差分度量来识别需要改进的区域。勘探步骤利用增强学习来检查最需要的地区的新轨迹。提取步骤将代理商的经验编码为新的基元。使用6-DOF非线性F-16型号在点对点导航任务上测试框架。框架的一次迭代发现了17个新的基元,并提供了最大规划时间减少96.91%。 3次完全迭代后,加入123个基元,最大时间减少97.39%。该框架易于扩展到一系列车辆,环境和成本函数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号