首页> 外文期刊>IEEE Transactions on Automatic Control >Robust Optimality for Discounted Infinite-Horizon Markov Decision Processes With Uncertain Transition Matrices
【24h】

Robust Optimality for Discounted Infinite-Horizon Markov Decision Processes With Uncertain Transition Matrices

机译:不确定转移矩阵的无穷折扣马尔可夫决策过程的鲁棒最优性

获取原文
获取原文并翻译 | 示例

摘要

We study finite-state, finite-action, discounted infinite-horizon Markov decision processes with uncertain transition matrices in the deterministic policy space. The transition matrices are classified as either independent or correlated. A generalized robust optimality criterion which can be degenerated to some popular optimality criteria is proposed, under which an optimal or near-optimal policy exists for any uncertain transition matrix. Theorems are developed to guarantee a stationary policy being optimal or near-optimal in the deterministic policy space.
机译:我们研究了在确定性政策空间中具有不确定转移矩阵的有限状态,有限作用,折现无限水平马尔可夫决策过程。过渡矩阵分为独立的或相关的。提出了可以退化为一些流行的最优准则的广义鲁棒最优准则,在该准则下,对于任何不确定的转移矩阵,存在最优或接近最优的策略。建立定理以确保固定策略在确定性策略空间中是最佳或接近最优的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号