POLICY ITERATION ALGORITHM FOR SINGULAR CONTROLLED DIFFUSION PROCESSES?

YUAN-HUA NI; HAI-TAO FANG

首页> 外文期刊>SIAM Journal on Control and Optimization >POLICY ITERATION ALGORITHM FOR SINGULAR CONTROLLED DIFFUSION PROCESSES?

【24h】

POLICY ITERATION ALGORITHM FOR SINGULAR CONTROLLED DIFFUSION PROCESSES?

机译：奇异控制扩散过程的策略迭代算法？

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, the infinite horizon optimal control problems for singular diffusion processes are considered from the viewpoints of Markov decision processes and perturbation analysis, where the singularity of diffusion means that the covariance matrix of the system noise is allowed to be degenerate. A formula of performance difference under two different controls is derived and leads to a comparison theorem. By the comparison theorem, starting from a control, a so-called better control can be selected. Therefore, a control policy iteration algorithm is developed, by which the performance improves step by step and converges to the optimal one. When this applies to the stochastic affine nonlinear regulator and stochastic linear quadratic optimal control problems, better control can be constructed in a closed form. It is also shown that when the considered stochastic systems degenerate to the deterministic ones, the proposed algorithm reduces to the adaptive dynamic programming algorithm [J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, Adaptive dynamic programming, IEEE Trans. Systems Man Cybernet., 32 (2002), pp. 140-153] for the affine nonlinear systems and to the well-known Kleinman algorithm [D. L. Kleinman, On an iterative technique for Riccati equation computation, IEEE Trans. Automat. Control, 13 (1968), pp. 114-115] for the linear quadratic optimal control problem.

机译：本文从马尔可夫决策过程和扰动分析的角度出发，考虑了奇异扩散过程的无限视界最优控制问题，其中奇异扩散意味着可以简化系统噪声的协方差矩阵。推导了两种不同控制下的性能差异公式，并得出比较定理。通过比较定理，从控制开始，可以选择所谓的更好的控制。因此，开发了一种控制策略迭代算法，使算法性能逐步提高并收敛到最优值。当这适用于随机仿射非线性调节器和随机线性二次最优控制问题时，可以以封闭形式构造更好的控制。研究还表明，当所考虑的随机系统退化为确定性系统时，所提出的算法简化为自适应动态规划算法[J． J. Murray，C。J. Cox，G。G. Lendaris和R. Saeks，自适应动态编程，IEEE Trans。 Systems Man Cybernet。，32（2002），pp。140-153]仿射非线性系统和著名的Kleinman算法[D. L. Kleinman，关于Riccati方程计算的迭代技术，IEEE Trans。自动机Control，13（1968），pp。114-115]。

著录项

来源
《SIAM Journal on Control and Optimization》 |2013年第5期|共19页
作者
YUAN-HUA NI; HAI-TAO FANG;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类运筹学;控制论、信息论（数学理论）;
关键词
perturbation analysis; Markov decision process; policy iteration algorithm; stochastic optimal control; singular diffusion processes;

机译：摄动分析;马尔可夫决策过程;策略迭代算法;随机最优控制;奇异扩散过程;

相似文献

外文文献
中文文献
专利

1. POLICY ITERATION ALGORITHM FOR SINGULAR CONTROLLED DIFFUSION PROCESSES? [J] . YUAN-HUA NI, HAI-TAO FANG SIAM Journal on Control and Optimization . 2013,第5期

机译：奇异控制扩散过程的策略迭代算法？
2. The policy iteration algorithm for average continuous control of piecewise deterministic Markov processes [J] . Costa O.L.V., Dufour F. Applied mathematics and optimization . 2010,第2期

机译：分段确定性马尔可夫过程平均连续控制的策略迭代算法
3. The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes [J] . O. L. V. Costa, F. Dufour Applied Mathematics & Optimization . 2010,第2期

机译：分段确定性马尔可夫过程平均连续控制的策略迭代算法
4. Online Policy Iteration Algorithm for Semi-Markov Switching State-Space Control Processes [C] . Qi Jiang, Hong-Sheng Xi, Bao-Qin Yin IEEE Conference on Decision and Control . 2009

机译：半马尔可夫切换状态空间控制过程的在线策略迭代算法
5. Optimization problems for diffusion processes. Some aspects of singular stochastic control and minimum relative entropy calibration. [D] . Kruk, Lukasz. 1999

机译：扩散过程的优化问题。奇异随机控制和最小相对熵校准的某些方面。
6. Optimisation of CT protocols in PET-CT across different scanner models using different automatic exposure control methods and iterative reconstruction algorithms [O] . Sarah-May Gould, Jane Mackewn, Sugama Chicklore, 2021

机译：不同自动曝光控制方法和迭代重建算法不同扫描仪模型中PET-CT中CT协议的优化
7. The policy iteration algorithm for average continuous control of piecewise deterministic Markov processes [O] . Costa, Oswaldo, Dufour, François 2010

机译：分段确定性马尔可夫过程平均连续控制的策略迭代算法

POLICY ITERATION ALGORITHM FOR SINGULAR CONTROLLED DIFFUSION PROCESSES?

摘要

著录项

相似文献

相关主题

期刊订阅