A Policy Improvement Method in Constrained Stochastic Dynamic Programming

Chang H.S.

首页> 外文期刊>IEEE Transactions on Automatic Control >A Policy Improvement Method in Constrained Stochastic Dynamic Programming

【24h】

A Policy Improvement Method in Constrained Stochastic Dynamic Programming

机译：约束随机动态规划中的一种策略改进方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This note presents a formal method of improving a given base-policy such that the performance of the resulting policy is no worse than that of the base-policy at all states in constrained stochastic dynamic programming. We consider finite horizon and discounted infinite horizon cases. The improvement method induces a policy iteration-type algorithm that converges to a local optimal policy.

机译：本说明介绍了一种改进给定基本策略的正式方法，以使在受限的随机动态规划中，所得策略的性能不比所有州的基本策略的性能差。我们考虑有限范围和折现无限范围的情况。该改进方法产生了收敛于局部最优策略的策略迭代型算法。

著录项

来源
《IEEE Transactions on Automatic Control》 |2006年第2006期|p.1523-1526|共4页
作者
Chang H.S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词
Constrained Markov decision process; dynamic programming; policy improvement; policy iteration; Constrained Markov decision process; dynamic programming; policy improvement; policy iteration;

机译：约束马尔可夫决策过程;动态规划;策略改进;策略迭代;约束马尔可夫决策过程;动态规划;策略改进;策略迭代;

相似文献

外文文献
中文文献
专利

1. A Policy Improvement Method in Constrained Stochastic Dynamic Programming [J] . Chang H.S. IEEE Transactions on Automatic Control . 2006,第9期

机译：约束随机动态规划中的一种策略改进方法
2. COMPUTING AVERAGE OPTIMAL CONSTRAINED POLICIES IN STOCHASTIC DYNAMIC PROGRAMMING [J] . Linn I. Sennott 20f Probability in the Engineering and Informational Sciences . 2001,第1期

机译：随机动态规划中的平均最优约束策略
3. A Dynamic Programming Policy Improvement Approach to the Development of Maintenance Policies for 2-Phase Systems With Aging [J] . MacPherson A. J., Glazebrook K. D. Reliability, IEEE Transactions on . 2011,第2期

机译：具有老化的两相系统维护策略开发的动态规划策略改进方法
4. Dynamic programming equations for constrained stochastic control [C] . Chen, R.C., Blankenship, . 2002

机译：约束随机控制的动态规划方程
5. Dynamic asset allocation by stochastic programming methods [D] . Collomb, Alexis 2005

机译：通过随机编程方法动态分配资产
6. Model reduction for stochastic CaMKII reaction kinetics in synapses by graph-constrained correlation dynamics [O] . Todd Johnson, Tom Bartol, Terrence Sejnowski, -1

机译：通过图约束相关动力学模型减少突触中随机CaMKII反应动力学的模型
7. New Solution Methods for Joint Chance-Constrained Stochastic Programs with Random Left-Hand Sides [O] . Tanner Matthew W. 2010

机译：带有随机左手边的联合机会约束随机程序的新求解方法
8. I,II Convergence and Rate of Convergence Theorems for Constrained and Unconstrained Stochastic Approximation,via Weak Convergence Methods. III Numerical Studies for Constrained Stochastic Approximation Problems, [R] . kushner,harold j. lakshmivarahan, s. 1977

机译：I，II收敛性和受约束和无约束随机逼近的收敛速度定理，通过弱收敛方法。 III约束随机逼近问题的数值研究，

A Policy Improvement Method in Constrained Stochastic Dynamic Programming

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅