Recursive Markov Decision Processes and Recursive Stochastic Games

机译：递归马尔可夫决策过程和递归随机博弈

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce Recursive Markov Decision Processes (RMDPs) and Recursive Simple Stochastic Games (RSSGs), and study the decidability and complexity of algorithms for their analysis and verification. These models extend Recursive Markov Chains (RMCs), introduced in [EY05a, EY05b] as a natural model for verification of probabilistic procedural programs and related systems involving both recursion and probabilistic behavior. RMCs define a class of denumerable Markov chains with a rich theory generalizing that of stochastic context-free grammars and multi-type branching processes, and they are also intimately related to probabilistic pushdown systems. RMDPs & RSSGs extend RMCs with one controller or two adversarial players, respectively. Such extensions are useful for modeling nondeterministic and concurrent behavior, as well as modeling a system's interactions with an environment. We provide upper and lower bounds for deciding, given an RMDP (or RSSG) A and probability p, whether player 1 has a strategy to force termination at a desired exit with probability at least p. We also address "qualitative" termination, where p = 1, and model checking questions.

机译：我们介绍了递归马尔可夫决策过程（RMDP）和递归简单随机博弈（RSSG），并研究了算法的可判定性和复杂性，以进行分析和验证。这些模型扩展了在[EY05a，EY05b]中引入的递归马尔可夫链（RMC），作为验证概率过程程序和涉及递归和概率行为的相关系统的自然模型。 RMC用丰富的理论定义了一类可数的马尔可夫链，该理论概括了随机上下文无关文法和多类型分支过程的原理，并且它们也与概率下推系统密切相关。 RMDP和RSSG分别用一个控制者或两个对抗者来扩展RMC。这种扩展对于建模不确定性和并发行为以及对系统与环境的交互进行建模很有用。在给定RMDP（或RSSG）A和概率p的情况下，我们提供了上限和下限，用于确定玩家1是否具有以至少p的概率强制终止在所需出口处的策略。我们还将解决“定性”终止（其中p = 1）和模型检查问题。

著录项

来源
《International Colloquium on Automata, Languages and Programming(ICALP 2005); 20050711-15; Lisbon(PT)》|2005年|P.891-903|共13页
会议地点 Lisbon(PT)
作者
Kousha Etessami; Mihalis Yannakakis;
展开▼
作者单位

LFCS, School of Informatics, University of Edinburgh;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Recursive Markov Decision Processes and Recursive Stochastic Games [J] . KOUSHA ETESSAMI, MIHALIS YANNAKAKIS Journal of the Association for Computing Machinery . 2015,第2期

机译：递归马尔可夫决策过程和递归随机博弈
2. Modeling Human Recursive Reasoning Using Empirically Informed Interactive Partially Observable Markov Decision Processes [J] . Doshi P., Qu X., Goodie A. S., Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on . 2012,第6期

机译：使用经验丰富的交互式部分可观察的马尔可夫决策过程对人类递归推理建模
3. Reachability in recursive Markov decision processes [J] . Tomas Brazdil, Vaclav Brozek, Vojtech Forejt, Information and computation . 2008,第5期

机译：递归马尔可夫决策过程中的可到达性
4. Recursive Markov Decision Processes and Recursive Stochastic Games [C] . International Colloquium on Automata, Languages and Programming . 2005

机译：递归马尔可夫决策过程和递归随机游戏
5. Problem Solving Markov Models and Recursive Pedagogy [D] . Abu Deeb, Fatima A. 2018

机译：马尔可夫模型问题解决与递归教学法
6. Recursive utility in a Markov environment with stochastic growth [O] . Lars Peter Hansen, José A. Scheinkman 2012

机译：随机增长的马尔可夫环境中的递归效用
7. Recursive Markov Decision Processes and Recursive Stochastic Games [O] . 2008

机译：递归马尔可夫决策过程和递归随机博弈
8. Recursive Estimation and Segmentation in Autoregressive Processes with MarkovRegime [R] . Thuvesholmen, M. 1994

机译：markovRegime自回归过程的递归估计与分割

Recursive Markov Decision Processes and Recursive Stochastic Games

摘要

著录项

相似文献

相关主题

期刊订阅