首页> 中文期刊> 《系统科学与复杂性:英文版》 >A Fast Approximation Method for Partially Observable Markov Decision Processes

A Fast Approximation Method for Partially Observable Markov Decision Processes

         

摘要

This paper develops a new lower bound method for POMDPs that approximates the update of a belief by the update of its non-zero states.It uses the underlying MDP to explore the optimal reachable state space from initial belief and select actions during value iterations,which significantly accelerates the convergence speed.Also,an algorithm which collects and prunes belief points based on the upper and lower bounds is presented,and experimental results show that it outperforms some of the state-of-art point-based algorithms.

著录项

  • 来源
    《系统科学与复杂性:英文版》 |2018年第6期|1423-1436|共14页
  • 作者单位

    Department of Automation;

    University of Science and Technology of China;

    Hefei 230027;

    China;

    Department of Automation;

    University of Science and Technology of China;

    Hefei 230027;

    China;

    Department of Automation;

    University of Science and Technology of China;

    Hefei 230027;

    China;

    Department of Automation;

    University of Science and Technology of China;

    Hefei 230027;

    China;

  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号