首页> 外文会议>International Joint Conference on Artificial Intelligence >I-dual: Solving Constrained SSPs via Heuristic Search in the Dual Space
【24h】

I-dual: Solving Constrained SSPs via Heuristic Search in the Dual Space

机译:i-Dual:通过双层空间中通过启发式搜索解决受约束的SSP

获取原文

摘要

We consider the problem of generating optimal stochastic policies for Constrained Stochastic Shortest Path problems, which are a natural model for planning under uncertainty for resourcebounded agents with multiple competing objectives. While unconstrained SSPs enjoy a multitude of efficient heuristic search solution methods with the ability to focus on promising areas reachable from the initial state, the state of the art for constrained SSPs revolves around linear and dynamic programming algorithms which explore the entire state space. In this paper, we present i-dual, the first heuristic search algorithm for constrained SSPs. To concisely represent constraints and efficiently decide their violation, i-dual operates in the space of dual variables describing the policy occupation measures. It does so while retaining the ability to use standard value function heuristics computed by well-known methods. Our experiments show that these features enable i-dual to achieve up to two orders of magnitude improvement in run-time and memory over linear programming algorithms.
机译:我们考虑产生限制随机最短路径问题的最佳随机策略的问题,这是一个具有多种竞争目标的资源截障代理的不确定性下规划的自然模型。虽然未受控SSP享有多种高效的启发式搜索解决方案方法,其能够专注于从初始状态到达的有希望的区域,但是用于约束的SSP的最先进的SSPS围绕探索整个状态空间的线性和动态编程算法。在本文中,我们呈现I-Dual,第一个启发式搜索算法为约束的SSPS。简明扼要地表示限制,有效地决定其违规,I-Dual在描述政策占用措施的双变量空间中运行。它确实如此保留使用众所周知的方法计算的标准值函数启发式的能力。我们的实验表明,这些功能使I-Dual能够通过线性编程算法在运行时和内存中实现最多两个数量级改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号