...
首页> 外文期刊>SIAM Journal on Control and Optimization >Numerical approximations for stochastic differential games
【24h】

Numerical approximations for stochastic differential games

机译:随机微分对策的数值逼近

获取原文
获取原文并翻译 | 示例

摘要

The Markov chain approximation method is a widely used, robust, relatively easy to use, and efficient family of methods for the bulk of stochastic control problems in continuous time for reflected-jump-diffusion-type models. It has been shown to converge under broad conditions, and there are good algorithms for solving the numerical problems if the dimension is not too high. Versions of these methods have been used in applications to various two-player differential and stochastic dynamic games for a long time, and proofs of convergence are available for some cases, mainly using PDE-type techniques. In this paper, purely probabilistic proofs of convergence are given for a broad class of such problems, where the controls for the two players are separated in the dynamics and cost function, and which cover a substantial class not dealt with in previous works. Discounted and stopping time cost functions are considered. Finite horizon problems and problems where the process is stopped on first hitting an a priori given boundary can be dealt with by adapting the methods of [H. J. Kushner and P. Dupuis, Numerical Methods for Stochastic Control Problems, in Continuous Time, 2nd ed., Springer-Verlag, Berlin, New York, 2001] as done in this paper for the treated problems. The essential conditions are the weak-sense existence and uniqueness of solutions, an almost everywhere continuity condition, and that a weak local consistency condition holds almost everywhere for the numerical approximations, just as for the control problem. There are extensions to problems with controlled variance and jumps. [References: 41]
机译:马尔可夫链近似法是一种广泛使用的,健壮的,相对易于使用的,有效的方法族,用于反射跳跃扩散型模型在连续时间内的大量随机控制问题。已经证明它可以在宽泛的条件下收敛,并且如果维数不太高,则有很好的算法可以解决数值问题。这些方法的版本已长期用于各种两人差分和随机动态游戏中,并且在某些情况下(主要是使用PDE类型的技术)可获得收敛证明。在本文中,针对此类问题的广泛类给出了收敛的纯概率证明,其中两个参与者的控制在动力学和成本函数中是分开的,并且涵盖了以前工作中未涉及的实质类。考虑折扣和停工时间成本函数。有限的层级问题和在首先达到先验给定边界时停止过程的问题可以通过采用[H. J. Kushner和P. Dupuis,《随机控制问题的数值方法,连续时间》,第二版,Springer-Verlag,柏林,纽约,2001年,本文针对已解决的问题进行了处理。基本条件是弱解的存在性和唯一性,几乎处处都是连续性的条件,以及弱局部一致性条件对于数值逼近几乎处处都存在,就像控制问题一样。扩展了具有受控方差和跳跃的问题。 [参考:41]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号