NASH ∈-EQUILIBRIA FOR STOCHASTIC GAMES WITH TOTAL REWARD FUNCTIONS: AN APPROACH THROUGH MARKOV DECISION PROCESSES

Gonzalez-Padilla Francisco J.; Montes-de-Oca Raul

首页> 外文期刊>Kybernetika >NASH ∈-EQUILIBRIA FOR STOCHASTIC GAMES WITH TOTAL REWARD FUNCTIONS: AN APPROACH THROUGH MARKOV DECISION PROCESSES

【24h】

NASH ∈-EQUILIBRIA FOR STOCHASTIC GAMES WITH TOTAL REWARD FUNCTIONS: AN APPROACH THROUGH MARKOV DECISION PROCESSES

机译：具有总奖励功能的随机游戏的NASH∈平衡：一种基于马尔可夫决策过程的方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The main objective of this paper is to find structural conditions under which a stochastic game between two players with total reward functions has an epsilon-equilibrium. To reach this goal, the results of Markov decision processes are used to find epsilon-optimal strategies for each player and then the correspondence of a better answer as well as a more general version of Kakutani's Fixed Point Theorem to obtain the epsilon-equilibrium mentioned. Moreover, two examples to illustrate the theory developed are presented.

机译：本文的主要目的是找到结构条件，在该结构条件下，具有总奖励函数的两个参与者之间的随机博弈具有ε平衡。为了达到这个目标，使用马尔可夫决策过程的结果来找到每个玩家的ε最优策略，然后找到一个更好的答案以及更普遍的角谷定点定理的对应关系，以获得所提到的ε平衡。此外，还提供了两个例子来说明所发展的理论。

著录项

来源
《Kybernetika》 |2019年第1期|152-165|共14页
作者
Gonzalez-Padilla Francisco J.; Montes-de-Oca Raul;
展开▼
作者单位

Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Ave San Rafael Atlixco 186, Mexico City 09340, DF, Mexico;

Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Ave San Rafael Atlixco 186, Mexico City 09340, DF, Mexico;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
stochastic games; Nash equilibrium; Markov decision processes; total rewards;

机译：随机博弈;纳什均衡;马尔可夫决策过程;总报酬;

相似文献

外文文献
中文文献
专利

1. NASH ∈-EQUILIBRIA FOR STOCHASTIC GAMES WITH TOTAL REWARD FUNCTIONS: AN APPROACH THROUGH MARKOV DECISION PROCESSES [J] . Gonzalez-Padilla Francisco J., Montes-de-Oca Raul Kybernetika . 2019,第1期

机译：纳什∈平衡用于随机游戏的总奖励功能：通过马尔可夫决策过程的方法
2. The game-theoretical approach to Markov decision problems and determining Nash equilibria for stochastic positional games [J] . Dmitrii Lozovanu International Journal of Mathematical Modelling and Numerical Optimisation . 2011,第2期

机译：马尔可夫决策问题的博弈论方法和确定随机位置博弈的纳什均衡
3. Markov Decision Processes and Stochastic Games with Total Effective Payoff [J] . Endre Boros, Khaled Elbassioni, Vladimir Gurvich, LIPIcs : Leibniz International Proceedings in Informatics . 2015,第2期

机译：总有效收益的马尔可夫决策过程和随机博弈
4. Markov Decision Processes and Determining Nash Equilibria for Stochastic Positional Games [C] . Dmitrii Lozovanu, Stefan Pickl, Erik Kropat IFAC World Congress . 2011

机译：马尔可夫决策过程和确定随机位置游戏的纳什均衡
5. Two new computer based results in game theory related to combinatorial games and Nash equilibria. [D] . Oudalov, Vladimir. 2013

机译：博弈论中两个新的基于计算机的结果与组合博弈和纳什均衡有关。
6. Spike-based Decision Learning of Nash Equilibria in Two-Player Games [O] . Johannes Friedrich, Walter Senn 2012

机译：两人游戏中基于纳什均衡的基于峰值的决策学习
7. A Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward Criterion [O] . François Dufour, Alexandre Genadot 2020

机译：在预期总奖励标准下的离散时间马尔可夫决策过程的凸编程方法
8. Shift-Function Approach for Markov Decision Processes with Unbounded Returns [R] . Stidham, S. , Van Nunen, J. 1981

机译：具有无界收益的马尔可夫决策过程的移位函数方法

NASH ∈-EQUILIBRIA FOR STOCHASTIC GAMES WITH TOTAL REWARD FUNCTIONS: AN APPROACH THROUGH MARKOV DECISION PROCESSES

摘要

著录项

相似文献

相关主题

期刊订阅