首页> 外文会议>IEEE Conference on Control Technology and Applications >Random-Sampling Multipath Hypothesis Propagation for Cost Approximation in Long-Horizon Optimal Control
【24h】

Random-Sampling Multipath Hypothesis Propagation for Cost Approximation in Long-Horizon Optimal Control

机译:长期最优控制中成本近似的随机抽样多径假设传播

获取原文
获取外文期刊封面目录资料

摘要

In this paper, we develop a Monte-Carlo based heuristic approach to approximate the objective function in long horizon optimal control problems. In this approach, we evolve the system state over multiple trajectories into the future while sampling the noise disturbances at each time-step, and find the weighted average of the costs along all the trajectories. We call these methods random sampling - multipath hypothesis propagation or RS-MHP. These methods (or variants) exist in the literature; however, the literature lacks convergence results for a generic class of nonlinear systems. This paper fills this knowledge gap to a certain extent. We derive convergence results for the cost approximation error from the MHP methods and discuss their convergence (in probability) as the sample size increases. As a case study, we apply RS-MHP to approximate the cost function in a linear quadratic control problem and demonstrate the benefits of our approach against an existing and closely related approximation approach called nominal belief-state optimization.
机译:在本文中,我们开发了一种基于Monte-Carlo的启发式方法,以近似于长地平线最佳控制问题的目标函数。在这种方法中,我们在每个时间步骤中采样噪声干扰的同时将系统状态扩展到未来,并找到所有轨迹的成本的加权平均值。我们称这些方法随机采样 - 多径假设传播或RS-MHP。这些方法(或变体)存在于文献中;然而,文献缺乏通用类非线性系统的收敛结果。本文在一定程度上填补了这种知识差距。我们从MHP方法中获得成本近似误差的收敛结果,并随着样本大小的增加,讨论其收敛(以概率为单位)。作为一个案例研究,我们应用RS-MHP以近似线性二次控制问题的成本函数,并展示我们对现有和密切相关的近似方法的方法,称为标称信仰状态优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号