首页> 外文会议>Brazilian Conference on Intelligent Systems >On the Performance of Planning Through Backpropagation
【24h】

On the Performance of Planning Through Backpropagation

机译:论规划贯彻横向计算

获取原文

摘要

Planning problems with continuous state and action spaces are difficult to solve with existing planning techniques, specially when the state transition is defined by a high-dimension non-linear dynamics. Recently, a technique called Planning through Backpropagation (PtB) was introduced as an efficient and scalable alternative to traditional optimization-based methods for continuous planning problems. PtB leverages modern gradient descent algorithms and highly optimized automatic differentiation libraries to obtain approximate solutions. However, to date there have been no empirical evaluations comparing PtB with Linear-Quadratic (LQ) control problems. In this work, we compare PtB with an optimal algorithm from control theory called LQR, and its iterative version iLQR, when solving linear and non-linear continuous deterministic planning problems. The empirical results suggest that PtB can be an efficient alternative to optimizing non-linear continuous deterministic planning, being much easier to be implemented and stabilized than classical model-predictive control methods.
机译:使用连续状态和动作空间的规划问题很难利用现有的规划技术来解决,特别是当状态转换由高维非线性动态定义时。最近,通过BackProjagation(PTB)称为规划的技术被引入了一种基于传统优化的方法的有效和可扩展的替代方法,以进行持续规划问题。 PTB利用现代梯度下降算法和高度优化的自动差异化库来获得近似解决方案。然而,迄今为止,没有与线性二次(LQ)控制问题进行比较PTB的实证评估。在这项工作中,我们在求解线性和非线性连续确定性规划问题时,将PTB与名为LQR的控制理论的最佳算法及其迭代版ILQR进行比较。经验结果表明,PTB可以是优化非线性连续确定性规划的有效替代方案,比经典模型预测控制方法更容易实现和稳定。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号