Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation in Long-Horizon Optimal Control

Shankarachary Ragi; Hans D. Mittelmann

首页> 外文期刊>IEEE Control Systems Letters >Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation in Long-Horizon Optimal Control

【24h】

Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation in Long-Horizon Optimal Control

机译：随机采样Monte-Carlo树搜索用于长地平线最佳控制的成本近似的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We develop Monte-Carlo based heuristic approaches to approximate the objective function in long horizon optimal control problems. In these approaches, to approximate the expectation operator in the objective function, we evolve the system state over multiple trajectories into the future while sampling the noise disturbances at each time-step, and find the average (or weighted average) of the costs along all the trajectories. We call these methods random sampling - multipath hypothesis propagation or RS-MHP. These methods (or variants) exist in the literature; however, the literature lacks results on how well these approximation strategies converge. This letter fills this knowledge gap to a certain extent. We derive stochastic convergence results for the cost approximation error from the RS-MHP methods and discuss their convergence (in probability) as the sample size increases. We consider two case studies to demonstrate the effectiveness of our methods - a) linear quadratic control problem; b) unmanned aerial vehicle path optimization problem.

机译：我们开发了基于Monte-Carlo的启发式方法，以近似长地平线最佳控制问题的客观函数。在这些方法中，为了近似期望运营商在客观函数中，我们在每个时间步骤中对噪声干扰进行采样时，将系统状态扩展到未来，并找到所有成本的平均（或加权平均值）轨迹。我们调用这些方法<斜体XMLNS：mml =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink”>随机抽样 - 多径假设传播或RS-MHP。这些方法（或变体）存在于文献中;但是，文献缺乏这些近似策略汇聚的结果。这封信在一定程度上填补了这种知识差距。我们从RS-MHP方法的成本近似误差导出随机收敛结果，并随着样本大小的增加，讨论它们的收敛（以概率为单位）。我们考虑了两种案例研究，以证明我们的方法 - a）线性二次控制问题的有效性; b）无人驾驶飞行器路径优化问题。

著录项

来源
《IEEE Control Systems Letters》 |2021年第5期|1759-1764|共6页
作者
Shankarachary Ragi; Hans D. Mittelmann;
展开▼
作者单位

South Dakota School of Mines and Technology Rapid City SD USA;

School of Mathematical and Statistical Sciences Arizona State University Tempe AZ USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Trajectory; Licenses; Optimal control; Convergence; Random variables; Monte Carlo methods; Large Hadron Collider;

机译：轨迹;许可证;最优控制;融合;随机变量;蒙特卡罗方法;大型强子撞机;

相似文献

外文文献
中文文献
专利

1. Pareto-Optimal Transit Route Planning With Multi-Objective Monte-Carlo Tree Search [J] . Weng Di, Chen Ran, Zhang Jianhui, IEEE Transactions on Intelligent Transportation Systems . 2021,第2期

机译：帕累托 - 多目标Monte-Carlo树搜索的最佳运输路线规划
2. Costate approximation in optimal control using integral Gaussian quadrature orthogonal collocation methods [J] . Francolin Camila C., Benson David A., Hager William W., Optimal Control Applications and Methods . 2015,第4期

机译：使用积分高斯正交正交配置法的最优控制中的共态逼近。
3. A Monte-Carlo Tree Search based Tracking Control Approach for Timed Petri Nets [J] . Raphael Fritz, Nico Krebs, Ping Zhang IFAC PapersOnLine . 2020,第2期

机译：基于Monte-Carlo树搜索定时Petri网的跟踪控制方法
4. Random-Sampling Multipath Hypothesis Propagation for Cost Approximation in Long-Horizon Optimal Control [C] . Shankarachary Ragi, Hans D. Mittelmann IEEE Conference on Control Technology and Applications . 2020

机译：长期最优控制中成本近似的随机抽样多径假设传播
5. Wavelet-based finite element methods for approximations of optimal controls for distributed parameter systems [D] . Ko, Jeonghwan. 1996

机译：基于小波的有限元方法，用于分布式参数系统的最佳控制近似
6. A stochastic approximation algorithm with Markov chain Monte-Carlo method for incomplete data estimation problems [O] . Ming Gao Gu, Fan Hui Kong 1998

机译：马尔可夫链蒙特卡洛方法的不完全估计问题的随机近似算法
7. Costate Approximation in Optimal Control Using Integral Gaussian Quadrature Orthogonal Collocation Methods [O] . Camila C. Françolin, David A. Benson, William W. Hager, 2015

机译：基于积分高斯求积法的最优控制中的Costate逼近
8. Approximations and Computational Methods for Optimal Stopping and Stochastic Impulsive Control Problems, [R] . kushner,harold j. 1975

机译：最优停止和随机脉冲控制问题的近似和计算方法，

Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation in Long-Horizon Optimal Control

摘要

著录项

相似文献

相关主题

期刊订阅