...
首页> 外文期刊>Kybernetika >NASH ∈-EQUILIBRIA FOR STOCHASTIC GAMES WITH TOTAL REWARD FUNCTIONS: AN APPROACH THROUGH MARKOV DECISION PROCESSES
【24h】

NASH ∈-EQUILIBRIA FOR STOCHASTIC GAMES WITH TOTAL REWARD FUNCTIONS: AN APPROACH THROUGH MARKOV DECISION PROCESSES

机译:具有总奖励功能的随机游戏的NASH∈平衡:一种基于马尔可夫决策过程的方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The main objective of this paper is to find structural conditions under which a stochastic game between two players with total reward functions has an epsilon-equilibrium. To reach this goal, the results of Markov decision processes are used to find epsilon-optimal strategies for each player and then the correspondence of a better answer as well as a more general version of Kakutani's Fixed Point Theorem to obtain the epsilon-equilibrium mentioned. Moreover, two examples to illustrate the theory developed are presented.
机译:本文的主要目的是找到结构条件,在该结构条件下,具有总奖励函数的两个参与者之间的随机博弈具有ε平衡。为了达到这个目标,使用马尔可夫决策过程的结果来找到每个玩家的ε最优策略,然后找到一个更好的答案以及更普遍的角谷定点定理的对应关系,以获得所提到的ε平衡。此外,还提供了两个例子来说明所发展的理论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号