首页> 美国政府科技报告 >Optimality of Stationary Halting Policies and Finite Termination of Successive Approximations.
【24h】

Optimality of Stationary Halting Policies and Finite Termination of Successive Approximations.

机译:平稳停滞策略的最优性与逐次逼近的有限终止。

获取原文

摘要

The stopping and halting optimality of stationary halting policies in discrete-time-parameter S-state finite-action branching Markov decision chains are characterized by the finite termination of successive approximations. A policy is called halting (resp., stopping) if the expected population size at time N is zero for some N (resp., converges to zero as N approaches infinity). The value of a policy is the expected infinite-horizon income that it earns. An optimal stopping (resp., halting) policy is one having maximum value in that class of policies. It is shown that when the rewards are real (resp., real or minus infinity) valued, the N-th iterate of successive approximations (and a Gauss-Seidel improvement thereof) is a fixed point of the optimal return operator for some N when initiated with the value of a stationary halting policy if and only if that is so for some N < or = S; moreover this occurs if and only if there exists a halting stationary optimal stopping (resp., halting) policy.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号