Minimization of Variance on Controlled Markov Chain

机译：控制马尔可夫链的差异最小化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a variance criterion on a finitestage controlled Markov chain. The variance is a sample variance for the sequence of stage rewards (random variables) on the chain. Through a nonconventional dynamic programming approach,w e minimize the expected value o the variance over some large policy class. For computational simplicity,w e take the variance multiplied by the square of the total number of stages. Our invariant imbedding method expands the original state space by one dimension. We illustrate a two-state, two-action and two-stage model by stochastic decision tree-table method. This optimal solution is obtained by solving the backward recursive equation for the minimum value functions on expanded state spaces.

机译：我们考虑在Finitestage控制的马尔可夫链上的差异标准。方差是链中级奖励序列（随机变量）的样本方差。通过非共同的动态编程方法，W e将预期值o最小化了一些大型策略类的方差。为了计算简单性，W E采取方差乘以总级数的平方。我们不变的嵌入式方法通过一个维度扩展原始状态空间。我们通过随机决策树木表方法说明了两州的双效和两级模型。通过求解扩展状态空间上的最小值函数的后向递归方程来获得该最佳解决方案。

著录项

来源
《Bellman continuum International workshop on uncertain systems and soft computing》|2002年||共7页
会议地点
作者
Takayuki Ueno; Toshiharu Fujita; Seiichi Iwamoto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类管理信息系统;
关键词
stochastic minimization; variance criterion; expanded Markov policy; past-value sets; state expansion; decision tree-table; general policy; primitive policy;

机译：随机最小化;方差标准;扩大马尔可夫政策;过去的价值集;国家扩张;决策树木表;一般政策;原始政策;

相似文献

外文文献
中文文献
专利

1. Variance minimization and the overtaking optimality approach to continuous-time controlled Markov chains [J] . Tomás Prieto-Rumeau, Onésimo Hernández-Lerma Mathematical Methods of Operations Research . 2009,第3期

机译：连续时间受控马尔可夫链的方差最小化和超车最优方法
2. Variance minimization and the overtaking optimality approach to continuous-time controlled Markov chains [J] . Prieto-Rumeau T, Hernandez-Lerma O Mathematical methods of operations research . 2009,第3期

机译：连续时间受控马尔可夫链的方差最小化和超车最优方法
3. On the variance in controlled Markov chains [J] . Mandl Petr Kybernetika . 1971,第1期

机译：关于受控马尔可夫链的方差
4. Minimization of Variance on Controlled Markov Chain [C] . Takayuki Ueno, Toshiharu Fujita, Seiichi Iwamoto Proceedings of 9th Bellman Continuum International Workshop on Uncertain Systems and Soft Computing . 2002

机译：受控马尔可夫链上的方差最小化
5. Minimizing the average mean-first passage time for Markov chains associated with a graph. [D] . Allison, Mary. 2014

机译：最小化与图相关的马尔可夫链的平均均值优先通过时间。
6. Efficient Markov Chain Monte Carlo Implementation of Bayesian Analysis of Additive and Dominance Genetic Variances in Noninbred Pedigrees [O] . Patrik Waldmann, Jon Hallander, Fabian Hoti, 2008

机译：高效马尔可夫链蒙特卡罗非亲属谱系加性和优势遗传方差贝叶斯分析的实现
7. Comparison of asymptotic variances of inhomogeneous Markov chains with application to Markov chain Monte Carlo methods [O] . MAIRE, Florian, Douc, Randal, Olsson, Jimmy 2014

机译：非齐次马尔可夫链渐近方差的比较及其在马尔可夫链蒙特卡罗方法中的应用
8. Finding Optimal Policies for Markov Decision Chains: A Unifying Framework for Mean-Variance-Tradeoffs (Revised) [R] . Huang, Y., Kallenberg, L. C. M. 1993

机译：寻找马尔可夫决策链的最优政策：均值 - 方差 - 权衡的统一框架（修订）

Minimization of Variance on Controlled Markov Chain

摘要

著录项

相似文献

相关主题

期刊订阅