首页> 外文期刊>Automatic Control, IEEE Transactions on >The Optimal Observability of Partially Observable Markov Decision Processes: Discrete State Space
【24h】

The Optimal Observability of Partially Observable Markov Decision Processes: Discrete State Space

机译:部分可观马尔可夫决策过程的最优可观性:离散状态空间

获取原文
获取原文并翻译 | 示例
       

摘要

We consider autonomous partially observable Markov decision processes where the control action influences the observation process only. Considering entropy as the cost incurred by the Markov information state process, the optimal observability problem is posed as a Markov decision scheduling problem that minimizes the infinite horizon cost. This scheduling problem is shown to be equivalent to minimization of an entropy measure, called estimation entropy which is related to the invariant measure of the information state.
机译:我们考虑自主的可部分观测的马尔可夫决策过程,其中控制行为仅影响观测过程。将熵视为由马尔可夫信息状态过程引起的成本,将最佳可观察性问题提出为使无限远景成本最小化的马尔可夫决策调度问题。该调度问题显示为等效于熵测度的最小化,该熵测度被称为估计熵,其与信息状态的不变测度有关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号