首页> 美国政府科技报告 >Blackwell Optimality in the Class of All Policies in Markov Decision Chains witha Borel State Space and Unbounded Rewards
【24h】

Blackwell Optimality in the Class of All Policies in Markov Decision Chains witha Borel State Space and Unbounded Rewards

机译:具有Borel状态空间和无界奖励的马尔可夫决策链中所有策略类的Blackwell最优性

获取原文

摘要

This paper is the second part of the authors' study of Blackwell optimal policiesin Markov decision chains with a Borel state space and unbounded rewards. The authors prove that a stationary policy is Blackwell optimal in the class of all history-dependent policies if it is Blackwell optimal in the class of stationary policies. The authors also develop recurrence and drift conditions which ensure ergodicity and intergrability assumptions made in the previous paper, and which are more suitable for applications. As an example the authors study a cash-balance model.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号