Random-Sampling Multipath Hypothesis Propagation for Cost Approximation in Long-Horizon Optimal Control

机译：长期最优控制中成本近似的随机抽样多径假设传播

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we develop a Monte-Carlo based heuristic approach to approximate the objective function in long horizon optimal control problems. In this approach, we evolve the system state over multiple trajectories into the future while sampling the noise disturbances at each time-step, and find the weighted average of the costs along all the trajectories. We call these methods random sampling - multipath hypothesis propagation or RS-MHP. These methods (or variants) exist in the literature; however, the literature lacks convergence results for a generic class of nonlinear systems. This paper fills this knowledge gap to a certain extent. We derive convergence results for the cost approximation error from the MHP methods and discuss their convergence (in probability) as the sample size increases. As a case study, we apply RS-MHP to approximate the cost function in a linear quadratic control problem and demonstrate the benefits of our approach against an existing and closely related approximation approach called nominal belief-state optimization.

机译：在本文中，我们开发了一种基于Monte-Carlo的启发式方法，以近似于长地平线最佳控制问题的目标函数。在这种方法中，我们在每个时间步骤中采样噪声干扰的同时将系统状态扩展到未来，并找到所有轨迹的成本的加权平均值。我们称这些方法随机采样 - 多径假设传播或RS-MHP。这些方法（或变体）存在于文献中;然而，文献缺乏通用类非线性系统的收敛结果。本文在一定程度上填补了这种知识差距。我们从MHP方法中获得成本近似误差的收敛结果，并随着样本大小的增加，讨论其收敛（以概率为单位）。作为一个案例研究，我们应用RS-MHP以近似线性二次控制问题的成本函数，并展示我们对现有和密切相关的近似方法的方法，称为标称信仰状态优化。

著录项

来源
《IEEE Conference on Control Technology and Applications》|2020年|14-18|共5页
会议地点
作者
Shankarachary Ragi; Hans D. Mittelmann;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Trajectory; Optimal control; Linear programming; Convergence; Approximation error; Cost function;

机译：轨迹;最优控制;线性编程;收敛;逼近误差;成本函数;

相似文献

外文文献
中文文献
专利

1. Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation in Long-Horizon Optimal Control [J] . Shankarachary Ragi, Hans D. Mittelmann IEEE Control Systems Letters . 2021,第5期

机译：随机采样Monte-Carlo树搜索用于长地平线最佳控制的成本近似的方法
2. Costate approximation in optimal control using integral Gaussian quadrature orthogonal collocation methods [J] . Francolin Camila C., Benson David A., Hager William W., Optimal Control Applications and Methods . 2015,第4期

机译：使用积分高斯正交正交配置法的最优控制中的共态逼近。
3. OPTIMAL CONTROL BY DIRECT APPROXIMATION OF THE GRADIENT OF THE COST-TO-GO [J] . Douglas B. Tweed Control and Intelligent Systems . 2013,第1期

机译：直接估算成本成本的最优控制
4. Finite approximation of the optimal average cost for a class of stochastic control systems [C] . Alessandro N. Vargas, Joao B. R. do Val IFAC World Congress . 2011

机译：一类随机控制系统的最佳平均成本的有限近似
5. Analysis and approximations of terminal-state tracking optimal control problems and controllability problems constrained by linear and semilinear parabolic partial differential equations. [D] . Kwon, Hee-Dae. 2003

机译：线性和半线性抛物型偏微分方程约束的终端状态跟踪最优控制问题和可控制性问题的分析和逼近。
6. Cyber Risk Propagation and Optimal Selection of Cybersecurity Controls for Complex Cyberphysical Systems [O] . Georgios Kavallieratos, Georgios Spathoulas, Sokratis Katsikas 2021

机译：网络风险传播和复杂环形物理系统的网络安全控制的最佳选择
7. Costate Approximation in Optimal Control Using Integral Gaussian Quadrature Orthogonal Collocation Methods [O] . Camila C. Françolin, David A. Benson, William W. Hager, 2015

机译：基于积分高斯求积法的最优控制中的Costate逼近
8. Approximations and Optimal Control for the Pathwise Average Cost per Unit Time and Discounted Problems for Wideband Noise Driven Systems [R] . Kushner, H. J. 1988

机译：宽带噪声驱动系统的单位时间路径平均成本和折扣问题的近似和最优控制

Random-Sampling Multipath Hypothesis Propagation for Cost Approximation in Long-Horizon Optimal Control

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅