首页> 外文期刊>IEEE Control Systems Letters >Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation in Long-Horizon Optimal Control
【24h】

Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation in Long-Horizon Optimal Control

机译:随机采样Monte-Carlo树搜索用于长地平线最佳控制的成本近似的方法

获取原文
获取原文并翻译 | 示例
           

摘要

We develop Monte-Carlo based heuristic approaches to approximate the objective function in long horizon optimal control problems. In these approaches, to approximate the expectation operator in the objective function, we evolve the system state over multiple trajectories into the future while sampling the noise disturbances at each time-step, and find the average (or weighted average) of the costs along all the trajectories. We call these methods random sampling - multipath hypothesis propagation or RS-MHP. These methods (or variants) exist in the literature; however, the literature lacks results on how well these approximation strategies converge. This letter fills this knowledge gap to a certain extent. We derive stochastic convergence results for the cost approximation error from the RS-MHP methods and discuss their convergence (in probability) as the sample size increases. We consider two case studies to demonstrate the effectiveness of our methods - a) linear quadratic control problem; b) unmanned aerial vehicle path optimization problem.
机译:我们开发了基于Monte-Carlo的启发式方法,以近似长地平线最佳控制问题的客观函数。在这些方法中,为了近似期望运营商在客观函数中,我们在每个时间步骤中对噪声干扰进行采样时,将系统状态扩展到未来,并找到所有成本的平均(或加权平均值)轨迹。我们调用这些方法<斜体XMLNS:mml =“http://www.w3.org/1998/math/mathml”xmlns:xlink =“http://www.w3.org/1999/xlink”>随机抽样 - 多径假设传播或RS-MHP。这些方法(或变体)存在于文献中;但是,文献缺乏这些近似策略汇聚的结果。这封信在一定程度上填补了这种知识差距。我们从RS-MHP方法的成本近似误差导出随机收敛结果,并随着样本大小的增加,讨论它们的收敛(以概率为单位)。我们考虑了两种案例研究,以证明我们的方法 - a)线性二次控制问题的有效性; b)无人驾驶飞行器路径优化问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号