首页> 外文期刊>Automatic Control, IEEE Transactions on >An Argument for the Bayesian Control of Partially Observable Markov Decision Processes
【24h】

An Argument for the Bayesian Control of Partially Observable Markov Decision Processes

机译:部分可观察的马尔可夫决策过程的贝叶斯控制的一个争论

获取原文
获取原文并翻译 | 示例
       

摘要

This technical note concerns the control of partially observable Markov decision processes characterized by a prior distribution over the underlying hidden Markov model parameters. In such instances, the control problem is commonly simplified by first choosing a point estimate from the model prior, and then selecting the control policy that is optimal with respect to the point estimate. Our contribution is to demonstrate, through a tractable yet nontrivial example, that even the best control policies constructed in this manner can significantly underperform the Bayes optimal policy. While this is an operative assumption in the Bayes-adaptive Markov decision process literature, to our knowledge no such illustrative example has been formally proposed.
机译:本技术说明涉及对部分可观察到的马尔可夫决策过程的控制,该过程的特征是对基础隐马尔可夫模型参数进行先验分布。在这种情况下,通常通过先从模型先选择一个点估计,然后选择相对于该点估计最佳的控制策略来简化控制问题。我们的贡献是通过一个易于处理但又不平凡的例子来证明,即使以这种方式构造的最佳控制策略也可能大大落后于贝叶斯最佳策略。尽管这是适用于贝叶斯的马尔可夫决策过程文献中的一个有效假设,但据我们所知,尚未正式提出这样的说明性例子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号