Stochastic Dynamic Programming with Range and Ratio Criteria

机译：具有范围和比率标准的随机动态规划

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we consider a finite-stage stochastic dynamic programming with range and ratio criteria. The range criterion is the maximum reward over the total stage minus minimum reward. As a ratio criterion, we take the ratio of one additive reward to the other. Our optimization problem is to minimize the expected value of the range and the ratio over a large class of policies. For each criterion of range and ratio, we use an invariant imbedding method, which introduces a family of past-value sets for reward accumulation. The imbedding expands the original state space by two dimension. First, we derive a forward recursive equation for the sequence of past-value sets. Second, we derive a backward recursive formula for sequence of optimal value functions on augmented state spaces. Finally a numerical example is illustrated for a two-state, two-action and two-stage model.

机译：在本文中，我们考虑具有范围和比率标准的有限阶段随机动态规划。范围标准是整个阶段的最大奖励减去最小奖励。作为比率标准，我们采用一种附加奖励与另一种附加奖励的比率。我们的优化问题是在大型策略中将范围的期望值和比率最小化。对于范围和比率的每个标准，我们使用不变嵌入方法，该方法引入了一组过去值集以进行奖励累积。嵌入将原始状态空间扩展了二维。首先，我们推导过去值集序列的正向递归方程。其次，我们为增强状态空间上的最优函数序列推导了一个反向递归公式。最后，给出了两个状态，两个动作和两个阶段的模型的数值示例。

著录项

来源
《The Ninth Bellman Continuum International Workshop on Uncertain Systems and Soft Computing Jul 24-27, 2002 Beijing, China》|2002年|p.22-28|共7页
会议地点 Beijing(CN)
作者
Kazuyoshi Tsurusaki; Takayuki Ueno; Seiichi Iwamoto;
展开▼
作者单位

Department of Economics, Faculty of Economics, Nagasaki University, Nagasaki 850-8506, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 C9;
关键词
range; ratio; compound criterion; stochastic optimization; expanded markov policy; past-value sets; state expansion; dynamic programming; invariant imbedding;

机译：范围;比率;复合准则;随机优化;扩展马尔可夫策略;过去值集;状态扩展;动态规划;不变嵌入;

相似文献

外文文献
中文文献
专利

1. Sampling strategies and stopping criteria for stochastic dual dynamic programming: a case study in long-term hydrothermal scheduling [J] . Tito Homem-de-Mello, Vitor L. de Matos, Erlon C. Finardi Energy systems . 2011,第1期

机译：随机双重动态规划的采样策略和停止准则：长期热液调度的案例研究
2. Stochastic generation expansion planning by means of stochastic dynamic programming [J] . Mo B., Hegge J. IEEE Transactions on Power Systems . 1991,第2期

机译：随机动态规划的随机发电扩展计划
3. A stochastic dynamic programming approach to decision making in arranged marriages [J] . Batabyal A.A., Beladi H. Applied mathematics letters . 2011,第12期

机译：安排婚姻中的决策的随机动态规划方法
4. Stochastic Dynamics Programming with Range and Ratio Criteria [C] . Kazuyoshi Tsurusaki, Takayuki Ueno, Seiichi Iwamoto Bellman continuum International workshop on uncertain systems and soft computing . 2002

机译：具有范围和比率标准的随机动力学编程
5. Stochastic Dual Dynamic Programming and Backward Approximate Dynamic Programming with Integrated Crossing State Stochastic Models for Wind Power in Energy Storage Optimization [D] . Durante, Joseph L. 2020

机译：随机双动规范和倒退近似动态规划，具有集成交叉状态随机模型的蓄能优化
6. Population dynamics demographic stochasticity and the evolution of cooperation [O] . Michael Doebeli, Albert Blarer, Martin Ackermann 1997

机译：人口动态人口随机性和合作的演变
7. A stochastic dynamic programming approach to decision making in arranged marriages [O] . Batabyal Amitrajeet A., Beladi Hamid 2011

机译：安排婚姻中决策的随机动态规划方法

Stochastic Dynamic Programming with Range and Ratio Criteria

摘要

著录项

相似文献

相关主题

期刊订阅