首页> 外文期刊>SIAM Journal on Control and Optimization >RATES OF CONVERGENCE OF NUMERICAL METHODS FOR CONTROLLED REGIME-SWITCHING DIFFUSIONS WITH STOPPING TIMES IN THE COSTS
【24h】

RATES OF CONVERGENCE OF NUMERICAL METHODS FOR CONTROLLED REGIME-SWITCHING DIFFUSIONS WITH STOPPING TIMES IN THE COSTS

机译:成本中具有停止时间的受控系统切换扩散的数值方法的收敛速度

获取原文
获取原文并翻译 | 示例
           

摘要

This work is concerned with rates of convergence of Markov chain approximation methods for controlled switching diffusions. The cost function is defined on an infinite horizon with stopping times and without discount. Displaying both continuous dynamics and discrete events, the discrete events are modeled by continuous-time Markov chains to delineate a random environment and other random factors that cannot be represented by diffusion processes. This paper presents a first attempt using a probabilistic approach to treat such rates of convergence problems. In addition, in contrast to the significant developments in the literature using partial differential equation (PDE) methods for the approximation of controlled diffusions, there do not yet appear to be any PDE results to date for rates of convergence of numerical solutions for controlled switching diffusions, to the best of our knowledge. Although some of the working conditions in this paper such as the one-dimensional continuous state variable, nondegenerate diffusions, and control only on the drift may be seemingly strong, they are adequate as the starting point for using this new approach to treat the rates of convergence problems. Moreover, in the literature, to prove the convergence using Markov chain approximation methods for control problems involving cost functions with stopping (even for uncontrolled diffusion without switching), an added assumption was used to avoid the so-called tangency problem. As a by-product of our approach, by modifying the value function, it is demonstrated that the anticipated tangency problem will not arise in the sense of convergence in probability and in the sense of L-1.
机译:这项工作涉及用于控制开关扩散的马尔可夫链近似方法的收敛速度。成本函数是在具有停止时间且无折扣的无限范围内定义的。同时显示连续动态和离散事件,通过连续时间马尔可夫链对离散事件进行建模,以描绘随机环境和其他无法由扩散过程表示的随机因素。本文提出了使用概率方法来解决此类收敛速度问题的首次尝试。此外,与文献中使用偏微分方程(PDE)方法逼近受控扩散的重大进展相比,迄今为止,对于受控切换扩散的数值解的收敛速度,似乎还没有任何PDE结果。 ,据我们所知。尽管本文中的某些工作条件(如一维连续状态变量,非退化扩散以及仅对漂移的控制)似乎看似很强,但它们足以作为使用这种新方法来处理速率的起点。收敛问题。此外,在文献中,为了证明使用马尔可夫链逼近方法对涉及具有停止功能的成本函数的控制问题(即使是无切换的非受控扩散)的收敛性,使用了一个附加假设来避免所谓的相切问题。作为我们方法的副产品,通过修改值函数,证明在概率收敛和L-1的意义上不会出现预期的相切问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号