首页> 外文期刊>Journal of Applied Research and Technology >Mixed Acceleration Techniques for Solving Quickly Stochastic Shortest-Path Markov Decision Processes
【24h】

Mixed Acceleration Techniques for Solving Quickly Stochastic Shortest-Path Markov Decision Processes

机译:快速随机最短路径马尔可夫决策过程的混合加速技术

获取原文
           

摘要

In this paper we propose the combination of accelerated variants of value iteration mixed with improved prioritized sweeping for the fast solution of stochastic shortest-path Markov decision processes. Value iteration is a classical algorithm for solving Markov decision processes, but this algorithm and its variants are quite slow for solving considerably large problems. In order to improve the solution time, acceleration techniques such as asynchronous updates, prioritization and prioritized sweeping have been explored in this paper. A topological reordering algorithm was also compared with static reordering. Experimental results obtained on finite state and action-space stochastic shortest-path problems show that our approach achieves a considerable reduction in the solution time with respect to the tested variants of value iteration. For instance, the experiments showed in one test a reduction of 5.7 times with respect to value iteration with asynchronous updates.
机译:在本文中,我们提出了将值迭代的加速变量与改进的优先扫描结合在一起的方法,用于随机最短路径马尔可夫决策过程的快速求解。值迭代是解决马尔可夫决策过程的经典算法,但是该算法及其变体在解决相当大的问题时非常慢。为了缩短求解时间,本文研究了异步更新,优先级划分和优先级清除等加速技术。拓扑重排序算法也与静态重排序进行了比较。在有限状态和动作空间随机最短路径问题上获得的实验结果表明,相对于经过测试的值迭代变体,我们的方法可显着减少求解时间。例如,实验在一项测试中显示,使用异步更新的值迭代减少了5.7倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号