Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion

ROLANDO CAVAZOS-CADENA; DANIEL HERNANDEZ-HERNANDEZ

首页> 外文期刊>Stochastics: An International Journal of Probability and Stochastic Processes >Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion

【24h】

Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion

机译：具有风险敏感平均准则的部分可观察的受控马尔可夫链的逐次逼近

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Partially observable Markov decision chains with finite state, action and signal spaces are considered. The performance index is the risk-sensitive average criterion and, under conditions concerning reachability between the unobservable states and observability of the signals, it is shown that the value iteration algorithm can be implemented to approximate the optimal average cost, to determine a stationary policy whose performance index is arbitrarily close to the optimal one, and to establish the existence of solutions to the optimality equation. The results rely on an appropriate extension of the well-known Schweitzer's transformation.

机译：考虑具有有限状态，动作和信号空间的部分可观察的马尔可夫决策链。性能指标是风险敏感的平均标准，并且在涉及不可观察状态之间的可达性和信号的可观察性的条件下，表明可以使用值迭代算法来逼近最佳平均成本，从而确定其性能指标可任意接近最优指标，并建立最优方程解的存在性。结果依赖于著名的Schweitzer变换的适当扩展。

著录项

来源
《Stochastics: An International Journal of Probability and Stochastic Processes》 |2005年第6期|共32页
作者
ROLANDO CAVAZOS-CADENA; DANIEL HERNANDEZ-HERNANDEZ;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类概率论与数理统计;
关键词
Reduction to a completely observable model; Schweitzer's transformation; Equicontinuity of the value iteration functions; Birkhoff's distance; Lipschitz norm;

机译：归结为完全可观测的模型;Schweitzer变换;值迭代函数的等连续性;Birkhoff距离;Lipschitz范数;

相似文献

外文文献
中文文献
专利

1. Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion [J] . ROLANDO CAVAZOS-CADENA, DANIEL HERNANDEZ-HERNANDEZ Stochastics: An International Journal of Probability and Stochastic Processes . 2005,第6期

机译：具有风险敏感平均准则的部分可观察的受控马尔可夫链的逐次逼近
2. Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion [J] . Chavez-Rodriguez Selene, Cavazos-Cadena Rolando, Cruz-Suarez Hugo Journal of Optimization Theory and Applications . 2016,第2期

机译：具有风险敏感平均成本标准的受控半马尔可夫链
3. Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion [J] . Cavazos-Cadena R, Montes-De-Oca R Journal of Applied Probability . 2005,第4期

机译：具有风险敏感平均准则的受控马尔可夫链中的非平稳值迭代
4. Controlled Markov chains with risk-sensitive average cost criterion: the non-irreducible case [C] . Brau-Rojas, A., Fernandez-Gaucherand, . 2001

机译：具有风险敏感的平均成本准则的受控马尔可夫链：不可约案例
5. Controlled Markov chains with risk-sensitive average cost criterion. [D] . Brau Rojas, Agustin. 1999

机译：具有风险敏感平均成本准则的受控马尔可夫链。
6. Markov Chain Monte Carlo Inference of Parametric Dictionaries for Sparse Bayesian Approximations [O] . Theodora Chaspari, Andreas Tsiartas, Panagiotis Tsilifis, -1

机译：稀疏贝叶斯近似的参数字典的Markov Chain Monte Carlo推论
7. Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion [O] . Rolando Cavazos-Cadena, Raúl Montes-De-Oca 2005

机译：受控马尔可夫链中的非间断价值迭代，风险敏感平均标准
8. Partially Observable Markov Decision Processes with an Average Cost Criterion. [R] . Fernandex-Gaucherand, E., Arapostathis, A., Marcus, S. I. 1989

机译：具有平均成本准则的部分可观察马尔可夫决策过程。

Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion

摘要

著录项

相似文献

相关主题

期刊订阅