An Argument for the Bayesian Control of Partially Observable Markov Decision Processes

Vargo E.; Cogill R.

首页> 外文期刊>Automatic Control, IEEE Transactions on >An Argument for the Bayesian Control of Partially Observable Markov Decision Processes

【24h】

An Argument for the Bayesian Control of Partially Observable Markov Decision Processes

机译：部分可观察的马尔可夫决策过程的贝叶斯控制的一个争论

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This technical note concerns the control of partially observable Markov decision processes characterized by a prior distribution over the underlying hidden Markov model parameters. In such instances, the control problem is commonly simplified by first choosing a point estimate from the model prior, and then selecting the control policy that is optimal with respect to the point estimate. Our contribution is to demonstrate, through a tractable yet nontrivial example, that even the best control policies constructed in this manner can significantly underperform the Bayes optimal policy. While this is an operative assumption in the Bayes-adaptive Markov decision process literature, to our knowledge no such illustrative example has been formally proposed.

机译：本技术说明涉及对部分可观察到的马尔可夫决策过程的控制，该过程的特征是对基础隐马尔可夫模型参数进行先验分布。在这种情况下，通常通过先从模型先选择一个点估计，然后选择相对于该点估计最佳的控制策略来简化控制问题。我们的贡献是通过一个易于处理但又不平凡的例子来证明，即使以这种方式构造的最佳控制策略也可能大大落后于贝叶斯最佳策略。尽管这是适用于贝叶斯的马尔可夫决策过程文献中的一个有效假设，但据我们所知，尚未正式提出这样的说明性例子。

著录项

来源
《Automatic Control, IEEE Transactions on》 |2014年第10期|2796-2800|共5页
作者
Vargo E.; Cogill R.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Adaptation models; Bayes methods; Computational modeling; Hidden Markov models; Markov processes; Standards; Uncertainty; Adaptive control; Markov processes; stochastic optimal control; uncertain systems;

机译：适应模型;贝叶斯方法;计算建模;隐藏的马尔可夫模型;马尔可夫过程;标准;不确定;自适应控制马尔可夫过程;随机最优控制;不确定的系统;

相似文献

外文文献
中文文献
专利

1. A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes [J] . Ross St??phane, Pineau Joelle, Chaib-draa Brahim, Journal of machine learning research . 2011,第May期

机译：在局部可观的马尔可夫决策过程中进行学习和计划的贝叶斯方法
2. Stochastic Predictive Control for Partially Observable Markov Decision Processes With Time-Joint Chance Constraints and Application to Autonomous Vehicle Control [J] . Li Nan, Girard Anouck, Kolmanovsky Ilya Journal of Dynamic Systems, Measurement, and Control . 2019,第7期

机译：随机预测控制对部分观察到的马尔可夫决策过程，时间关节机会限制和应用于自主车辆控制
3. Decentralized control of multi-robot partially observable Markov decision processes using belief space macro-actions [J] . Shayegan Omidshafiei, Ali-Akbar Agha-Mohammadi, Christopher Amato, The International journal of robotics research . 2017,第2期

机译：使用置信空间宏作用的多机器人部分可观察的马尔可夫决策过程的分散控制
4. Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes [C] . Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami, 6th workshop on ontologies and lexical resources. . 2010

机译：使用部分可观察的马尔可夫决策过程控制面向听力的对话
5. Autonomous UAV Control and Testing Methods Utilizing Partially Observable Markov Decision Processes [D] . Eaton, Christopher M. 2018

机译：利用部分可观察的马尔可夫决策过程的自主无人机控制和测试方法
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. Stochastic Optimization of Controlled Partially Observable Markov Decision Processes [O] . Peter L. Bartlett, Jonathan Baxter 100

机译：受控部分可观察的马尔可夫决策过程的随机优化

An Argument for the Bayesian Control of Partially Observable Markov Decision Processes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅