首页> 外文会议>IEEE International Conference on Robotics and Automation >Acceleration of Gradient-based Path Integral Method for Efficient Optimal and Inverse Optimal Control
【24h】

Acceleration of Gradient-based Path Integral Method for Efficient Optimal and Inverse Optimal Control

机译:基于梯度的路径积分方法的加速度高效最佳和逆最佳控制

获取原文

摘要

This paper deals with a new accelerated path integral method, which iteratively searches optimal controls with a small number of iterations. This study is based on the recent observations that a path integral method for reinforcement learning can be interpreted as gradient descent. This observation also applies to an iterative path integral method for optimal control, which sets a convincing argument for utilizing various optimization methods for gradient descent, such as momentum-based acceleration, step-size adaptation and their combination. We introduce these types of methods to the path integral and demonstrate that momentum-based methods, like Nesterov Accelerated Gradient and Adam, can significantly improve the convergence rate to search for optimal controls in simulated control systems. We also demonstrate that the accelerated path integral could improve the performance on model predictive control for various vehicle navigation tasks. Finally, we represent this accelerated path integral method as a recurrent network, which is the accelerated version of the previously proposed path integral networks (PI-Net). We can train the accelerated PI-Net more efficiently for inverse optimal control with less RAM than the original PI-Net.
机译:本文涉及一种新的加速路径积分方法,其迭代地搜索具有少量迭代的最佳控制。本研究基于最近的观察结果,即加强学习的路径整体方法可以被解释为梯度下降。该观察结果也适用于用于最佳控制的迭代路径积分方法,其设定了利用各种优化方法的梯度下降的令人信服的参数,例如基于动量的加速度,步长调整及其组合。我们将这些类型的方法介绍到路径积分并证明,像Nesterov加速梯度和亚当一样的基于动量的方法可以显着提高要搜索模拟控制系统中最佳控制的收敛速度。我们还证明加速路径积分可以提高各种车辆导航任务的模型预测控制的性能。最后,我们将该加速路径积分方法表示为经常性网络,这是先前提出的路径积分网络(PI-Net)的加速版本。我们可以更有效地培训加速的PI-Net,以便与原始PI-NET更少的RAM逆最佳控制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号