首页> 外文期刊>ACM Transactions on Modeling and Computer Simulation >Interval Markov Decision Processes with Multiple Objectives: From Robust Strategies to Pareto Curves
【24h】

Interval Markov Decision Processes with Multiple Objectives: From Robust Strategies to Pareto Curves

机译:具有多个目标的区间马尔可夫决策过程:从鲁棒策略到帕累托曲线

获取原文
获取原文并翻译 | 示例
           

摘要

Accurate Modelling of a real-world system with probabilistic behaviour is a difficult task. Sensor noise and statistical estimations, among other imprecisions, make the exact probability values impossible to obtain. In this article, we consider Interval Markov decision processes (IMDPs), which generalise classical MDPs by having interval-valued transition probabilities. They provide a powerful modelling tool for probabilistic systems with an additional variation or uncertainty that prevents the knowledge of the exact transition probabilities. We investigate the problem of robust multi-objective synthesis for IMDPs and Pareto curve analysis of multi-objective queries on IMDPs. We study how to find a robust (randomised) strategy that satisfies multiple objectives involving rewards, reachability, and more general omega-regular properties against all possible resolutions of the transition probability uncertainties, as well as to generate an approximate Pareto curve providing an explicit view of the trade-offs between multiple objectives. We show that the multi-objective synthesis problem is PSPACE-hard and provide a value iteration-based decision algorithm to approximate the Pareto set of achievable points. We finally demonstrate the practical effectiveness of our proposed approaches by applying them on several case studies using a prototype tool.
机译:对具有概率行为的真实系统进行精确建模是一项艰巨的任务。传感器噪声和统计估计以及其他不确定性使得无法获得确切的概率值。在本文中,我们考虑了区间马尔可夫决策过程(IMDP),该过程通过具有间隔值的转移概率来概括经典MDP。它们为概率系统提供了强大的建模工具,并带有额外的变化或不确定性,从而无法了解确切的转移概率。我们研究了IMDP的鲁棒多目标综合问题以及IMDP上多目标查询的帕累托曲线分析。我们研究了如何找到一种健壮的(随机的)策略,该策略能够满足涉及转换概率不确定性的所有可能解决方案的涉及奖励,可及性和更一般的欧米茄常规性质的多个目标,以及如何生成提供清晰视图的近似帕累托曲线多个目标之间的权衡取舍。我们证明了多目标综合问题是PSPACE难题,并提供了一种基于值迭代的决策算法来近似可实现点的Pareto集。最后,我们通过使用原型工具将其应用于几个案例研究中,论证了我们提出的方法的实际有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号