The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains

Prieto-Rumeau T; Hernandez-Lerma O

首页> 外文期刊>Mathematical methods of operations research >The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains

【24h】

The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains

机译：Laurent系列，敏感折扣和Blackwell最优性用于连续时间受控的Markov链

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper gives conditions for the convergence of the Laurent series expansion for a class of continuous-time controlled Markov chains with possibly unbounded reward (or cost) rates and unbounded transition rates. That series is then used to study several optimization criteria, including n-discount optimality (for n = -1, 0, 1,...), Blackwell optimality, and the maximization of a certain vector criterion that in particular gives gain and bias optimality.

机译：本文为一类连续时间受控的马尔可夫链给出了Laurent级数展开的收敛条件，该连续时间受控的Markov链可能具有无穷大的回报（或成本）率和无穷大的过渡率。然后将该序列用于研究多个优化标准，包括n折扣最优（对于n = -1、0、1 ...），布莱克韦尔最优以及特定矢量准则的最大化，该准则尤其给出增益和偏置最优性。

著录项

来源
《Mathematical methods of operations research》 |2005年第1期|共23页
作者
Prieto-Rumeau T; Hernandez-Lerma O;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数学;
关键词
continuous-time controlled Markov chains (also known as Markov decision processes); Laurent series; sensitive discount criteria; Blackwell optimality; average reward criteria; BOREL STATE-SPACE; UNBOUNDED REWARDS; DECISION CHAINS; DISCRETE-TIME; POLICIES;

机译：连续时间受控Markov链（也称为Markov决策过程）;Laurent级数;灵敏的折扣标准;Blackwell最优;平均奖励标准;BOREL状态空间;无界奖励;决策链;离散时间;政策;

相似文献

外文文献
中文文献
专利

1. The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains [J] . Prieto-Rumeau T, Hernandez-Lerma O Mathematical methods of operations research . 2005,第1期

机译：Laurent系列，敏感折扣和Blackwell最优性用于连续时间受控的Markov链
2. Blackwell Optimality in the Class of Markov Policies for Continuous-Time Controlled Markov Chains [J] . Tomás Prieto-Rumeau Acta Applicandae Mathematicae . 2006,第1期

机译：连续时间受控马尔可夫链的马尔可夫策略类中的Blackwell最优性
3. Discounted continuous-time controlled Markov chains: Convergence of control models [J] . Prieto-Rumeau T., Hernández-Lerma O. Journal of Applied Probability . 2012,第4期

机译：折扣连续时间受控马尔可夫链：控制模型的收敛
4. Recent results in controlled Markov chains with risk sensitive average criteria: the vanishing discount approach [C] . Cavazos-Cadena, R., Fernandez-Gaucherand, . 1999

机译：具有风险敏感平均标准的受控马尔可夫链的最新结果：消失的贴现法
5. Controlled Markov chains with risk-sensitive average cost criterion. [D] . Brau Rojas, Agustin. 1999

机译：具有风险敏感平均成本准则的受控马尔可夫链。
6. SIMULATION FROM ENDPOINT-CONDITIONED CONTINUOUS-TIME MARKOV CHAINS ON A FINITE STATE SPACE WITH APPLICATIONS TO MOLECULAR EVOLUTION [O] . Asger Hobolth, Eric A. Stone -1

机译：动态模拟端点空调连续时间的马尔可夫链在有限状态空间应用程序分子进化
7. Discounted Continuous-Time Controlled Markov Chains: Convergence of Control Models [O] . Tomás Prieto-Rumeau, Onésimo Hernández-Lerma 2012

机译：折扣连续时间控制马尔可夫链：控制模型的融合
8. Blackwell Optimality in the Class of All Policies in Markov Decision Chains witha Borel State Space and Unbounded Rewards [R] . Hordijk, A., Yushkevich, A. A. 2000

机译：具有Borel状态空间和无界奖励的马尔可夫决策链中所有策略类的Blackwell最优性

The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains

摘要

著录项

相似文献

相关主题

期刊订阅