首页> 外文期刊>Operations Research Letters: A Journal of the Operations Research Society of America >Approximation of two-person zero-sum continuous-time Markov games with average payoff criterion
【24h】

Approximation of two-person zero-sum continuous-time Markov games with average payoff criterion

机译:具有平均收益标准的两人零和连续时间马尔可夫游戏的逼近

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

We consider a two-person zero-sum continuous-time Markov game g with denumerable state space, Sorel action spaces, unbounded payoff and transition rates, under the long-run expected average payoff criterion. To approximate numerically the value of 9, we construct finite state and actions game models g(n) whose value functions converge to the value of g. Rates of convergence are given. We propose a policy iteration algorithm for the finite state and actions games g(n). We show an application to a population system. (C) 2014 Elsevier B.V. All rights reserved.
机译:在长期预期平均收益标准下,我们考虑了具有可数状态空间,Sorel动作空间,无穷收益和过渡率的两人零和连续时间马尔可夫博弈g。为了用数字近似9的值,我们构造了有限状态和动作博弈模型g(n),其值函数收敛到g的值。给出了收敛速度。我们为有限状态和动作博弈g(n)提出了一种策略迭代算法。我们展示了对人口系统的应用。 (C)2014 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号