首页> 美国政府科技报告 >Optimality of Stationary Halting Policies and Finite Termination of Successive Approximations.

【24h】

Optimality of Stationary Halting Policies and Finite Termination of Successive Approximations.

机译：平稳停滞策略的最优性与逐次逼近的有限终止。

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The stopping and halting optimality of stationary halting policies in discrete-time-parameter S-state finite-action branching Markov decision chains are characterized by the finite termination of successive approximations. A policy is called halting (resp., stopping) if the expected population size at time N is zero for some N (resp., converges to zero as N approaches infinity). The value of a policy is the expected infinite-horizon income that it earns. An optimal stopping (resp., halting) policy is one having maximum value in that class of policies. It is shown that when the rewards are real (resp., real or minus infinity) valued, the N-th iterate of successive approximations (and a Gauss-Seidel improvement thereof) is a fixed point of the optimal return operator for some N when initiated with the value of a stationary halting policy if and only if that is so for some N < or = S; moreover this occurs if and only if there exists a halting stationary optimal stopping (resp., halting) policy.

著录项

作者
erickson, ranel e.;
展开▼
作者单位

展开▼
年度 1978
页码 1-27
总页数 27
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Decision theory; Stopping rules(Mathematics); Combinatorial analysis; Optimization; Markov processes; Convergence; Iterations; Stationary; Gauss-Seidel method; Markov decision processes;

机译：决策理论;停止规则（数学）;组合分析;优化;马尔可夫过程;收敛;迭代;固定;高斯 - 赛德尔方法;马尔可夫决策过程;

相似文献

外文文献
中文文献
专利

1. Existence of optimal stationary policies in finite dynamic programs with nonnegative rewards: an alternative approach [J] . Rolando Cavazos-Cadena, Raul Montes-de-Oca Probability in the Engineering and Informational Sciences . 2001,第4期

机译：具有非负奖励的有限动态程序中最优平稳策略的存在：一种替代方法
2. Optimal halting policies in Markov population decision chains with constant risk posture [J] . Pelin G. Canbolat Annals of Operations Research . 2014,第nova期

机译：具有恒定风险态势的马尔可夫种群决策链中的最优止损策略
3. Finite-time optimal control of polynomial systems using successive suboptimal approximations [J] . Xu X., Agrawal SK. Journal of Optimization Theory and Applications . 2000,第2期

机译：使用连续次优逼近的多项式系统有限时间最优控制
4. Complexity Analysis of Optimal Stationary Call Admission Policy and Fixed Set Partitioning Policy for OVSF-CDMA Cellular Systems [C] . Daniel Lee, Muhammad Naeem, Chingyu Hsu, Annual Canadian Conference on Electrical and Computer Engineering . 2007

机译：OVSF-CDMA蜂窝系统的最优静止呼叫录取政策和固定集分区策略的复杂性分析
5. Topology reconfiguration with successive approximations. [D] . Baskaran, Eswaran. 2007

机译：具有逐次逼近的拓扑重新配置。
6. Designing evaluation studies to optimally inform policy: what factors do policy-makers in China consider when making resource allocation decisions on healthcare worker training programmes? [O] . Shishi Wu, Helena Legido-Quigley, Julia Spencer, 2018

机译：设计评估研究以最佳地为政策提供信息：中国的决策者在制定医护人员培训计划的资源分配决策时会考虑哪些因素？
7. Optimal stationary policies inrisk-sensitive dynamic programs with finite state spaceand nonnegative rewards [O] . Rolando Cavazos-Cadena, Raúl Montes-de-Oca 2000

机译：具有有限状态空间和非负奖励的风险敏感动态程序的最佳固定政策

Optimality of Stationary Halting Policies and Finite Termination of Successive Approximations.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅