首页> 外文会议>European Conference on Artificial Intelligence >Path-Constrained Markov Decision Processes: bridging the gap between probabilistic model-checking and decision-theoretic planning
【24h】

Path-Constrained Markov Decision Processes: bridging the gap between probabilistic model-checking and decision-theoretic planning

机译:路径约束的马尔可夫决策过程:弥合概率模型 - 检查与决策定制规划之间的差距

获取原文

摘要

Markov Decision Processes (MDPs) are a popular model for planning under probabilistic uncertainties. The solution of an MDP is a policy represented as a controlled Markov chain, whose complex properties on execution paths can be automatically validated using stochastic model-checking techniques. In this paper, we propose a new theoretical model, named Path-Constrained Markov Decision Processes: it allows system designers to directly optimize safe policies in a single design pass, whose possible executions are guaranteed to satisfy some probabilistic constraints on their paths, expressed in Probabilistic Real Time Computation Tree Logic. We mathematically analyze properties of PC-MDPs and provide an iterative linear programming algorithm for solving them. We also present experiments that illustrate PC-MDPs and highlight their benefits.
机译:马尔可夫决策过程(MDPS)是一个流行的规划模式,用于根据概率不确定性规划。 MDP的解决方案是表示为受控马尔可夫链的策略,其可以使用随机模型检查技术自动验证执行路径上的复杂性质。在本文中,我们提出了一种新的理论模型,命名为路径约束的马尔可夫决策过程:它允许系统设计人员直接在单个设计通过中优化安全策略,其可能的执行是为了满足其路径上的某些概率约束,表达了概率实时计算树逻辑。我们在数学上分析PC-MDP的特性,并提供一种迭代线性编程算法来解决它们。我们还提出了说明PC-MDP的实验,并突出了它们的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号