首页> 外文会议>European Conference on Artificial Intelligence >Path-Constrained Markov Decision Processes: bridging the gap between probabilistic model-checking and decision-theoretic planning

【24h】

Path-Constrained Markov Decision Processes: bridging the gap between probabilistic model-checking and decision-theoretic planning

机译：路径约束的马尔可夫决策过程：弥合概率模型 - 检查与决策定制规划之间的差距

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Markov Decision Processes (MDPs) are a popular model for planning under probabilistic uncertainties. The solution of an MDP is a policy represented as a controlled Markov chain, whose complex properties on execution paths can be automatically validated using stochastic model-checking techniques. In this paper, we propose a new theoretical model, named Path-Constrained Markov Decision Processes: it allows system designers to directly optimize safe policies in a single design pass, whose possible executions are guaranteed to satisfy some probabilistic constraints on their paths, expressed in Probabilistic Real Time Computation Tree Logic. We mathematically analyze properties of PC-MDPs and provide an iterative linear programming algorithm for solving them. We also present experiments that illustrate PC-MDPs and highlight their benefits.

机译：马尔可夫决策过程（MDPS）是一个流行的规划模式，用于根据概率不确定性规划。 MDP的解决方案是表示为受控马尔可夫链的策略，其可以使用随机模型检查技术自动验证执行路径上的复杂性质。在本文中，我们提出了一种新的理论模型，命名为路径约束的马尔可夫决策过程：它允许系统设计人员直接在单个设计通过中优化安全策略，其可能的执行是为了满足其路径上的某些概率约束，表达了概率实时计算树逻辑。我们在数学上分析PC-MDP的特性，并提供一种迭代线性编程算法来解决它们。我们还提出了说明PC-MDP的实验，并突出了它们的好处。

著录项

来源
《European Conference on Artificial Intelligence》|2012年||共6页
会议地点
作者
Florent Teichteil-Koenigsbuch;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Elaboration Tolerant Representation of Markov Decision Process via Decision-Theoretic Extension of Probabilistic Action Language Pbc+ [J] . Wang Yi, Lee Joohyung Theory and Practice of Logic Programming . 2021,第3期

机译：通过决策 - 理论延伸的概率动作语言PBC +阐述马尔可夫决策过程的详细说明
2. Decision-Theoretic Planning with non-Markovian Rewards [J] . Gretton C., Kabanza F., Price D., The Journal of Artificial Intelligence Research . 2006,第12期

机译：具有非马尔可夫奖赏的决策理论规划
3. Decision-Theoretic Planning with non-Markovian Rewards [J] . S. Thiebaux, C. Gretton, J. Slaney, Journal of Automation, Mobile Robotics & Intelligent Systems . 2006,第5期

机译：具有非马尔可夫奖赏的决策理论规划
4. Path-Constrained Markov Decision Processes: bridging the gap between probabilistic model-checking and decision-theoretic planning [C] . Florent Teichteil-Koenigsbuch 20th European conference on artificial intelligence . 2012

机译：路径受限的马尔可夫决策过程：弥合概率模型检查与决策理论计划之间的差距
5. Conditional probabilistic logic programming for probability model construction with application to decision-theoretic planning. [D] . Ngo, Liem Huu. 1997

机译：用于概率模型构建的条件概率逻辑编程，并应用于决策理论规划。
6. Decision-theoretic refinement planning: a new method for clinical decision analysis. [O] . A. Doan, P. Haddawy, C. E. Kahn Jr 1995

机译：决策理论优化计划：一种用于临床决策分析的新方法。
7. Unifying Nondeterministic and Probabilistic Planning Through Imprecise Markov Decision Processes [O] . Felipe W. Trevizan, Fábio G. Cozman, Leliane N. De Barros 2016

机译：通过不精确的马尔可夫决策过程统一不确定性和概率规划
8. Monte Carlo Simulation of Markov, Semi-Markov, and Generalized Semi- Markov Processes in Probabilistic Risk Assessment [R] . English, Thomas 2005

机译：概率风险评估中马尔可夫，半马尔可夫和广义半马尔可夫过程的monte Carlo模拟

Path-Constrained Markov Decision Processes: bridging the gap between probabilistic model-checking and decision-theoretic planning

摘要

著录项

相似文献

相关主题

期刊订阅