Recursive Markov Decision Processes and Recursive Stochastic Games

机译：递归马尔可夫决策过程和递归随机游戏

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce Recursive Markov Decision Processes (RMDPs) and Recursive Simple Stochastic Games (RSSGs), and study the decidability and complexity of algorithms for their analysis and verification. These models extend Recursive Markov Chains (RMCs), introduced in [EY05a, EY05b] as a natural model for verification of probabilistic procedural programs and related systems involving both recursion and probabilistic behavior. RMCs define a class of denumerable Markov chains with a rich theory generalizing that of stochastic context-free grammars and multi-type branching processes, and they are also intimately related to probabilistic pushdown systems. RMDPs & RSSGs extend RMCs with one controller or two adversarial players, respectively. Such extensions are useful for modeling nondeterministic and concurrent behavior, as well as modeling a system's interactions with an environment. We provide upper and lower bounds for deciding, given an RMDP (or RSSG) A and probability p, whether player 1 has a strategy to force termination at a desired exit with probability at least p. We also address "qualitative" termination, where p = 1, and model checking questions.

机译：我们介绍递归马尔可夫决策过程（RMDPS）和递归简单随机游戏（RSSG），并研究算法的分析和验证算法的可解除性和复杂性。这些模型扩展了递归Markov链（RMC），以[EY05A，EY05B]作为自然模型，用于验证概率的程序程序和相关系统，涉及递归和概率行为。 RMCS定义一类可降价的马尔可夫链，具有丰富的理论，概括了随机上下文语法和多型分支过程的概括，它们与概率下推系统密切相关。 RMDPS＆RSSGS分别使用一个控制器或两个对抗运动员扩展RMC。这种扩展对于建模非识别和并发行为是有用的，以及建模系统与环境的交互。我们提供用于确定RMDP（或RSSG）A和概率P的播放器1的上下界限，无论播放器1是否具有以所需的出口终止概率至少p的策略。我们还解决了“定性”终止，其中P = 1，以及模型检查问题。

著录项

来源
《International Colloquium on Automata, Languages, and Programming》|2005年||共13页
会议地点
作者
Kousha Etessami; Mihalis Yannakakis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-532;
关键词

相似文献

外文文献
中文文献
专利

1. Recursive Markov Decision Processes and Recursive Stochastic Games [J] . KOUSHA ETESSAMI, MIHALIS YANNAKAKIS Journal of the Association for Computing Machinery . 2015,第2期

机译：递归马尔可夫决策过程和递归随机博弈
2. Modeling Human Recursive Reasoning Using Empirically Informed Interactive Partially Observable Markov Decision Processes [J] . Doshi P., Qu X., Goodie A. S., Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on . 2012,第6期

机译：使用经验丰富的交互式部分可观察的马尔可夫决策过程对人类递归推理建模
3. Reachability in recursive Markov decision processes [J] . Tomas Brazdil, Vaclav Brozek, Vojtech Forejt, Information and computation . 2008,第5期

机译：递归马尔可夫决策过程中的可到达性
4. Recursive Markov Decision Processes and Recursive Stochastic Games [C] . Kousha Etessami, Mihalis Yannakakis International Colloquium on Automata, Languages and Programming(ICALP 2005); 20050711-15; Lisbon(PT) . 2005

机译：递归马尔可夫决策过程和递归随机博弈
5. Problem Solving Markov Models and Recursive Pedagogy [D] . Abu Deeb, Fatima A. 2018

机译：马尔可夫模型问题解决与递归教学法
6. Recursive utility in a Markov environment with stochastic growth [O] . Lars Peter Hansen, José A. Scheinkman 2012

机译：随机增长的马尔可夫环境中的递归效用
7. Recursive Markov Decision Processes and Recursive Stochastic Games [O] . 2008

机译：递归马尔可夫决策过程和递归随机博弈
8. Recursive Estimation and Segmentation in Autoregressive Processes with MarkovRegime [R] . Thuvesholmen, M. 1994

机译：markovRegime自回归过程的递归估计与分割

Recursive Markov Decision Processes and Recursive Stochastic Games

摘要

著录项

相似文献

相关主题

期刊订阅