首页> 外文会议>International Conference on Automated Planning and Scheduling >Prioritizing Bellman Backups Without a Priority Queue
【24h】

Prioritizing Bellman Backups Without a Priority Queue

机译:在没有优先级队列的情况下优先考虑Bellman备份

获取原文

摘要

Several researchers have shown that the efficiency of value iteration, a dynamic programming algorithm for Markov decision processes, can be improved by prioritizing the order of Bellman backups to focus computation on states where the value function can be improved the most. In previous work, a priority queue has been used to order backups. Although this incurs overhead for maintaining the priority queue, previous work has argued that the overhead is usually much less than the benefit from prioritization. However this conclusion is usually based on a comparison to a non-prioritized approach that performs Bellman backups on states in an arbitrary order. In this paper, we show that the overhead for maintaining the priority queue can be greater than the benefit, when it is compared to very simple heuristics for prioritizing backups that do not require a priority queue. Although the order of backups induced by our simple approach is often sub-optimal, we show that its smaller overhead allows it to converge faster than other state-of-the-art priority-based solvers.
机译:几个研究人员已经表明,通过优先考虑贝尔曼备份顺序来对焦可以提高价值函数的状态,可以提高价值迭代的动态编程算法,可以提高Markov决策过程的动态编程算法。在以前的工作中,优先级队列已被用于订购备份。虽然这种招收了维持优先队列的开销,但之前的工作据称,开销通常远低于优先级的好处。然而,这一结论通常基于与非优先顺序方法的比较,这些方法以任意顺序执行贝尔曼备份。在本文中,我们表明,维护优先级队列的开销可以大于益处,当它与非常简单的启发式进行比较,以优先考虑不需要优先级队列的备份。虽然我们的简单方法引起的备份顺序通常是次优,但我们表明其较小的开销允许其比其他最先进的优先级的求解器更快地收敛。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号