Stochastic Over-subscription Planning using Hierarchies of MDPs

机译：使用MDP层次结构的随机超额预订计划

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In over-subscription planning (OSP), the set of goals is not achievable jointly, and the task is to find a plan that attains the best feasible subset of goals given resource constraints. Recent classical OSP algorithms ignore the uncertainty inherent in many natural application domains where OSPs arise. And while modeling stochastic OSP problems as MDPs is easy, the resulting models are too large for standard solution approaches. Fortunately OSP problems have a natural two-tiered hierarchy, and in this paper we adapt and extend tools developed in the hierarchical reinforcement learning community in order to effectively exploit this hierarchy and obtain compact, factored policies. Typically, such policies are sub-optimal, but under certain assumptions that hold in our planetary exploration domain, our factored solution is, in fact, optimal. Our algorithms work by repeatedly solving a number of smaller MDPs, while propagating information between them. We evaluate a number of variants of this approach on a set of stochastic instances of a planetary rover domain, showing substantial performance gains.

机译：在超额预订计划（OSP）中，不能共同实现一组目标，而任务是找到一个在资源有限的情况下达到最佳可行目标子集的计划。最近的经典OSP算法忽略了出现OSP的许多自然应用领域固有的不确定性。尽管将随机OSP问题建模为MDP很容易，但对于标准解决方案方法而言，所得模型太大。幸运的是，OSP问题具有自然的两级层次结构，在本文中，我们适应并扩展了在层次强化学习社区中开发的工具，以便有效利用此层次结构并获得紧凑，分解的策略。通常，此类政策不是最优的，但在我们的行星勘探领域中采用的某些假设下，事实上，我们的分解式解决方案是最佳的。我们的算法通过反复求解许多较小的MDP，同时在它们之间传播信息来工作。我们在行星漫游车域的一组随机实例上评估了这种方法的多种变体，显示出可观的性能提升。

著录项

来源
《International Conference on Automated Planning and Scheduling(ICAPS 2006); 2006;》|2006年|P.121-130|共10页
会议地点
作者
Nicolas Meuleau; Ronen Brafman; Emmanuel Benazera;
展开▼
作者单位

NASA Ames Research Center, Mail Stop 269-3 Moffet Field, CA 94035-1000;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 N12;
关键词
入库时间 2022-08-26 14:15:26

相似文献

外文文献
中文文献
专利

1. Planning to see: A hierarchical approach to planning visual actions on a robot using POMDPs [J] . Mohan Sridharan, Jeremy Wyatt, Richard Dearden Artificial intelligence . 2010,第11期

机译：计划观看：使用POMDP计划机器人视觉动作的分层方法
2. A hybrid POMDP-BDI agent architecture with online stochastic planning and plan caching [J] . Rens Gavin, Moodley Deshendran Cognitive Systems Research . 2017,第JUNa期

机译：具有在线随机计划和计划缓存的混合POMDP-BDI代理架构
3. Active Visual Planning for Mobile Robot Teams Using Hierarchical POMDPs [J] . Zhang, S., Sridharan, IEEE Transactions on Robotics . 2013,第4期

机译：使用分层POMDP对移动机器人团队进行主动视觉规划
4. Probabilistic Hierarchical Planning over MDPs(Extended Abstract) [C] . Yuqing Tang, Felipe Meneguzzi, Katia Sycara, International conference on autonomous agents and multiagent systems;AAMAS 2011 . 2011

机译：MDP上的概率层次规划（扩展摘要）
5. Planning under uncertainty: From informative path planning to partially observable semi-MDPs [D] . Wei, Lim Zhan. 2015

机译：不确定性下的计划：从信息路径计划到部分可观察的半MDP
6. Context/Resource-Aware Mission Planning Based on BNs and Concurrent MDPs for Autonomous UAVs [O] . Chabha Hireche, Catherine Dezan, Stéphane Mocanu, 2018

机译：基于BN和并发MDP的自主无人机的上下文/资源感知任务计划
7. Planning to see: A hierarchical approach to planning visual actions on a robot using POMDPs [O] . Sridharan Mohan, Wyatt Jeremy, Dearden Richard 2010

机译：计划观看：使用POMDP计划机器人视觉动作的分层方法

Stochastic Over-subscription Planning using Hierarchies of MDPs

摘要

著录项

相似文献

相关主题

期刊订阅