...
首页> 外文期刊>Neurocomputing >A hierarchical path planning approach based on A* and least-squares policy iteration for mobile robots
【24h】

A hierarchical path planning approach based on A* and least-squares policy iteration for mobile robots

机译:基于A *和最小二乘策略迭代的移动机器人分层路径规划方法

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a novel hierarchical path planning approach for mobile robot navigation in complex environments. The proposed approach has a two-level structure. In the first level, the A* algorithm based on grids is used to find a geometric path quickly and several path points are selected as subgoals for the next level. In the second level, an approximate policy iteration algorithm called least-squares policy iteration (LSPI) is used to learn a near-optimal local planning policy that can generate smooth trajectories under kinematic constraints of the robot. Using this near-optimal local planning policy, the mobile robot can find an optimized path by sequentially approaching the subgoals obtained in the first level. One advantage of the proposed approach is that the kinematic characteristics of the mobile robot can be incorporated into the LSPI-based path optimization procedure. The second advantage is that the LSPI-based local path optimizer uses an approximate policy iteration algorithm which has been proven to be data-efficient and stable. The training of the local path optimizer can use sample experiences collected randomly from any reasonable sampling distribution. Furthermore, the LSPI-based local path optimizer has the ability of dealing with uncertainties in the environment. For unknown obstacles, it just needs to replan the path in the second level rather than the whole planner. Simulations for path planning in various types of environments have been carried out and the results demonstrate the effectiveness of the proposed approach. (C) 2015 Elsevier B.V. All rights reserved.
机译:在本文中,我们为复杂环境中的移动机器人导航提出了一种新颖的分层路径规划方法。所提出的方法具有两级结构。在第一级中,基于网格的A *算法用于快速查找几何路径,并选择多个路径点作为下一级的子目标。在第二级中,一种称为最小二乘策略迭代(LSPI)的近似策略迭代算法用于学习一种接近最佳的局部计划策略,该策略可以在机器人的运动学约束下生成平滑的轨迹。使用这种接近最佳的本地计划策略,移动机器人可以通过顺序接近在第一级中获得的子目标来找到优化路径。所提出的方法的一个优点是可以将移动机器人的运动学特性纳入基于LSPI的路径优化过程中。第二个优点是基于LSPI的本地路径优化器使用一种近似策略迭代算法,该算法已被证明是数据高效且稳定的。本地路径优化器的培训可以使用从任何合理的采样分布中随机收集的采样经验。此外,基于LSPI的本地路径优化器具有处理环境中不确定性的能力。对于未知的障碍,只需要在第二级而不是整个计划者中重新规划路径。已经对各种类型的环境中的路径规划进行了仿真,结果证明了该方法的有效性。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号