首页> 外文期刊>SIAM Journal on Control and Optimization >POLICY ITERATION ALGORITHM FOR SINGULAR CONTROLLED DIFFUSION PROCESSES?
【24h】

POLICY ITERATION ALGORITHM FOR SINGULAR CONTROLLED DIFFUSION PROCESSES?

机译:奇异控制扩散过程的策略迭代算法?

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, the infinite horizon optimal control problems for singular diffusion processes are considered from the viewpoints of Markov decision processes and perturbation analysis, where the singularity of diffusion means that the covariance matrix of the system noise is allowed to be degenerate. A formula of performance difference under two different controls is derived and leads to a comparison theorem. By the comparison theorem, starting from a control, a so-called better control can be selected. Therefore, a control policy iteration algorithm is developed, by which the performance improves step by step and converges to the optimal one. When this applies to the stochastic affine nonlinear regulator and stochastic linear quadratic optimal control problems, better control can be constructed in a closed form. It is also shown that when the considered stochastic systems degenerate to the deterministic ones, the proposed algorithm reduces to the adaptive dynamic programming algorithm [J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, Adaptive dynamic programming, IEEE Trans. Systems Man Cybernet., 32 (2002), pp. 140-153] for the affine nonlinear systems and to the well-known Kleinman algorithm [D. L. Kleinman, On an iterative technique for Riccati equation computation, IEEE Trans. Automat. Control, 13 (1968), pp. 114-115] for the linear quadratic optimal control problem.
机译:本文从马尔可夫决策过程和扰动分析的角度出发,考虑了奇异扩散过程的无限视界最优控制问题,其中奇异扩散意味着可以简化系统噪声的协方差矩阵。推导了两种不同控制下的性能差异公式,并得出比较定理。通过比较定理,从控制开始,可以选择所谓的更好的控制。因此,开发了一种控制策略迭代算法,使算法性能逐步提高并收敛到最优值。当这适用于随机仿射非线性调节器和随机线性二次最优控制问题时,可以以封闭形式构造更好的控制。研究还表明,当所考虑的随机系统退化为确定性系统时,所提出的算法简化为自适应动态规划算法[J. J. Murray,C。J. Cox,G。G. Lendaris和R. Saeks,自适应动态编程,IEEE Trans。 Systems Man Cyber​​net。,32(2002),pp。140-153]仿射非线性系统和著名的Kleinman算法[D. L. Kleinman,关于Riccati方程计算的迭代技术,IEEE Trans。自动机Control,13(1968),pp。114-115]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号