首页> 外文会议>IEEE/RSJ International Conference on Intelligent Robots and Systems >Solving Markov Decision Processes with Reachability Characterization from Mean First Passage Times
【24h】

Solving Markov Decision Processes with Reachability Characterization from Mean First Passage Times

机译:从平均首次通过时间开始求解具有可到达性特征的Markov决策过程

获取原文

摘要

A new mechanism for efficiently solving the Markov decision processes (MDPs) is proposed in this paper. We introduce the notion of reachability landscape where we use the Mean First Passage Time (MFPT) as a means to characterize the reachability of every state in the state space. We show that such reachability characterization very well assesses the importance of states and thus provides a natural basis for effectively prioritizing states and approximating policies. Built on such a novel observation, we design two new algorithms - Mean First Passage Time based Value Iteration (MFPT-VI) and Mean First Passage Time based Policy Iteration (MFPT-PI) - that have been modified from the state-of-the-art solution methods. To validate our design, we have performed numerical evaluations in robotic decision-making scenarios, by comparing the proposed new methods with corresponding classic baseline mechanisms. The evaluation results showed that MFPT-VI and MFPT-PI have outperformed the state-of-the-art solutions in terms of both practical runtime and number of iterations. Aside from the advantage of fast convergence, this new solution method is intuitively easy to understand and practically simple to implement.
机译:提出了一种有效解决马尔可夫决策过程的新机制。我们介绍了可达性格局的概念,其中我们使用平均首次通过时间(MFPT)作为表征状态空间中每个状态的可达性的方法。我们表明,这种可达性特征很好地评估了状态的重要性,从而为有效地确定状态的优先级和近似策略提供了自然基础。基于这种新颖的观察,我们设计了两种新算法-基于当前状态的平均首次通过时间的值迭代(MFPT-VI)和基于平均首次通过时间的策略迭代(MFPT-PI)。先进的解决方法。为了验证我们的设计,我们在机器人决策场景中进行了数值评估,方法是将建议的新方法与相应的经典基线机制进行比较。评估结果表明,就实际运行时间和迭代次数而言,MFPT-VI和MFPT-PI优于最新解决方案。除了快速收敛的优点外,这种新的解决方案方法直观上易于理解,并且易于实施。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号