首页> 外文会议>International joint conference on artificial intelligence >Robust Optimization for Hybrid MDPs with State-Dependent Noise
【24h】

Robust Optimization for Hybrid MDPs with State-Dependent Noise

机译:具有状态相关噪声的混合MDP的鲁棒优化

获取原文

摘要

Recent advances in solutions to Hybrid MDPs with discrete and continuous state and action spaces have significantly extended the class of MDPs for which exact solutions can be derived,albeit at the expense of a restricted transition noise model.In this paper,we work around limitations of previous solutions by adopting a robust optimization approach in which Nature is allowed to adversarially determine transition noise within pre-specified confidence intervals.This allows one to derive an optimal policy with an arbitrary (user-specified) level of success probability and significantly extends the class of transition noise models for which Hybrid MDPs can be solved.This work also significantly extends results for the related “chance-constrained” approach in stochastic hybrid control to accommodate state-dependent noise.We demonstrate our approach working on a variety of hybrid MDPs taken from AI planning,operations research,and control theory,noting that this is the first time robust solutions with strong guarantees over all states have been automatically derived for such problems.
机译:具有离散状态和连续状态以及动作空间的混合MDP解决方案的最新进展显着扩展了MDP的类别,尽管可以牺牲受限的过渡噪声模型,但仍可以得出精确的解决方案。通过采用鲁棒的优化方法的先前解决方案,其中允许Nature在预定的置信区间内以对抗性的方式确定过渡噪声,从而可以得出具有任意(用户指定)成功概率级别的最优策略,并显着扩展了类别可以解决混合MDP的过渡噪声模型。这项工作还显着扩展了随机混合控制中相关的“机会约束”方法的结果,以适应与状态相关的噪声。我们证明了我们的方法适用于各种混合MDP来自AI计划,运筹学和控制理论的资料,这是第一次强大的解决方案对于此类问题,已经自动得出了对所有州都具有有力保证的声明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号