Minimization of Variance on Controlled Markov Chain

机译：受控马尔可夫链上的方差最小化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a variance criterion on a finite-stage controlled Markov chain. The variance is a sample variance for the sequence of stage rewards (random variables) on the chain. Through a nonconventinal dynamic programming approach, we minimize the expected value of the variance over some large policy class. For computational simplicity, we take the variance multiplied by the square of the total number of stages. Our invariant imbedding method expands the original state space by one dimension. We illustrate a two-state, two-action and two-stage model by stochastic decision tree-table method. This optimal solution is obtained by solving the backward recursive equation for the minimum value functions on expanded state spaces.

机译：我们考虑有限级受控马尔可夫链上的方差准则。方差是链上阶段奖励序列（随机变量）的样本方差。通过非常规的动态规划方法，我们将某些较大的策略类别的方差的期望值最小化。为了简化计算，我们将方差乘以阶段总数的平方。我们的不变嵌入方法将原始状态空间扩展了一维。我们通过随机决策树表方法说明了一种两状态，两动作，两阶段模型。通过对展开状态空间上的最小值函数求解后向递归方程，可以得到最佳解。

著录项

来源
《The Ninth Bellman Continuum International Workshop on Uncertain Systems and Soft Computing Jul 24-27, 2002 Beijing, China》|2002年|p.192-198|共7页
会议地点 Beijing(CN)
作者
Takayuki Ueno; Toshiharu Fujita; Seiichi Iwamoto;
展开▼
作者单位

Department of Economic Engineering, Graduate School of Economics Kyushu University 27, Hakozaki 6-19-1, Higashiku, Fukuoka 812-8581, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 C9;
关键词
stochastic minimization; variance criterion; expanded markov policy; past-value sets; state expansion; decision tree-table; general policy; primitive policy;

机译：随机最小化；方差准则；扩展的马尔可夫策略；过去值集；状态扩展；决策树表；一般策略；原始策略;

相似文献

外文文献
中文文献
专利

1. Variance minimization and the overtaking optimality approach to continuous-time controlled Markov chains [J] . Tomás Prieto-Rumeau, Onésimo Hernández-Lerma Mathematical Methods of Operations Research . 2009,第3期

机译：连续时间受控马尔可夫链的方差最小化和超车最优方法
2. Variance minimization and the overtaking optimality approach to continuous-time controlled Markov chains [J] . Prieto-Rumeau T, Hernandez-Lerma O Mathematical methods of operations research . 2009,第3期

机译：连续时间受控马尔可夫链的方差最小化和超车最优方法
3. On the variance in controlled Markov chains [J] . Mandl Petr Kybernetika . 1971,第1期

机译：关于受控马尔可夫链的方差
4. Minimization of Variance on Controlled Markov Chain [C] . Takayuki Ueno, Toshiharu Fujita, Seiichi Iwamoto Proceedings of 9th Bellman Continuum International Workshop on Uncertain Systems and Soft Computing . 2002

机译：受控马尔可夫链上的方差最小化
5. Minimizing the average mean-first passage time for Markov chains associated with a graph. [D] . Allison, Mary. 2014

机译：最小化与图相关的马尔可夫链的平均均值优先通过时间。
6. Efficient Markov Chain Monte Carlo Implementation of Bayesian Analysis of Additive and Dominance Genetic Variances in Noninbred Pedigrees [O] . Patrik Waldmann, Jon Hallander, Fabian Hoti, 2008

机译：高效马尔可夫链蒙特卡罗非亲属谱系加性和优势遗传方差贝叶斯分析的实现
7. Comparison of asymptotic variances of inhomogeneous Markov chains with application to Markov chain Monte Carlo methods [O] . MAIRE, Florian, Douc, Randal, Olsson, Jimmy 2014

机译：非齐次马尔可夫链渐近方差的比较及其在马尔可夫链蒙特卡罗方法中的应用
8. Finding Optimal Policies for Markov Decision Chains: A Unifying Framework for Mean-Variance-Tradeoffs (Revised) [R] . Huang, Y., Kallenberg, L. C. M. 1993

机译：寻找马尔可夫决策链的最优政策：均值 - 方差 - 权衡的统一框架（修订）

Minimization of Variance on Controlled Markov Chain

摘要

著录项

相似文献

相关主题

期刊订阅