...
首页> 外文期刊>Theoretical computer science >Optimal schedulers vs optimal bases: An approach for efficient exact solving of Markov decision processes
【24h】

Optimal schedulers vs optimal bases: An approach for efficient exact solving of Markov decision processes

机译:最佳计划程序与最佳基准:有效精确求解马尔可夫决策过程的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Quantitative model checkers for Markov decision processes typically use finite-precision arithmetic. If all the coefficients in the process are rational numbers, then the model checking results are rational, and so they can be computed exactly. However, exact techniques are generally too expensive or limited in scalability. In this paper we propose a method for obtaining exact results starting from an approximated solution in finite-precision arithmetic. The input of the method is a description of a scheduler, which can be obtained by a model checker using finite precision. Given a scheduler, we show how to obtain a corresponding basis in a linear programming problem, in such a way that the basis is optimal whenever the scheduler attains the worst-case probability. This correspondence is already known for discounted MDPs, we show how to apply it in the undiscounted case provided that some preprocessing is done. Using the correspondence, the linear programming problem can be solved in exact arithmetic starting from the basis obtained. As a consequence, the method finds the worst-case probability even if the scheduler provided by the model checker was not optimal. In our experiments, the calculation of exact solutions from a candidate scheduler is significantly faster than the calculation using the simplex method under exact arithmetic starting from a default basis.
机译:Markov决策过程的定量模型检查器通常使用有限精度算法。如果过程中所有系数均为有理数,则模型检验结果为有理数,因此可以精确计算它们。但是,精确的技术通常太昂贵或可扩展性受到限制。在本文中,我们提出了一种从有限精度算术中的近似解开始获取精确结果的方法。该方法的输入是对调度程序的描述,可以通过模型检查器使用有限精度来获得该​​调度程序。给定一个调度程序,我们将说明如何在线性规划问题中获得相应的基础,使得只要调度程序达到最坏情况的概率,基础都是最优的。这种对应关系对于折价的MDP来说是众所周知的,我们展示了如何在不打折的情况下将其应用,前提是需要进行一些预处理。使用该对应关系,可以从获得的基础开始,以精确的算术解决线性规划问题。结果,即使模型检查器提供的调度程序不是最佳的,该方法也能找到最坏情况的概率。在我们的实验中,使用候选调度程序的精确解的计算要比使用单纯形法从默认基础开始的精确算术的计算快得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号