A Numerical Method for the Evaluation of the Distribution of Cumulative Reward till Exit of a Subset of Transient States of a Markov Reward Model

Carrasco Juan A.; Sune Victor

首页> 外文期刊>Dependable and Secure Computing, IEEE Transactions on >A Numerical Method for the Evaluation of the Distribution of Cumulative Reward till Exit of a Subset of Transient States of a Markov Reward Model

【24h】

A Numerical Method for the Evaluation of the Distribution of Cumulative Reward till Exit of a Subset of Transient States of a Markov Reward Model

机译：评估马尔可夫奖励模型瞬态子集的累积奖励到退出状态的数值方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Markov reward models have interesting modeling applications, particularly those addressing fault-tolerant hardware/software systems. In this paper, we consider a Markov reward model with a reward structure including only reward rates associated with states, in which both positive and negative reward rates are present and null reward rates are allowed, and develop a numerical method to compute the distribution function of the cumulative reward till exit of a subset of transient states of the model. The method combines a model transformation step with the solution of the transformed model using a randomization construction with two randomization rates. The method introduces a truncation error, but that error is strictly bounded from above by a user-specified error control parameter. Further, the method is numerically stable and takes advantage of the sparsity of the infinitesimal generator of the transformed model. Using a Markov reward model of a fault-tolerant hardware/software system, we illustrate the application of the method and analyze its computational cost. Also, we compare the computational cost of the method with that of the (only) previously available method for the problem. Our numerical experiments seem to indicate that the new method can be efficient and that for medium size and large models can be substantially faster than the previously available method.

机译：马尔可夫奖赏模型具有有趣的建模应用程序，尤其是那些针对容错硬件/软件系统的应用程序。在本文中，我们考虑一种具有仅包含与状态相关的奖励率的奖励结构的马尔可夫奖励模型，其中存在正和负奖励率，并且允许零奖励率，并开发了一种数值方法来计算收益的分布函数。直到模型的瞬态子集退出为止的累积奖励。该方法使用具有两个随机率的随机化构造将模型转换步骤与转换后的模型的解组合在一起。该方法引入了截断错误，但是该错误从上方严格受到用户指定的错误控制参数的限制。此外，该方法在数值上是稳定的，并且利用了变换模型的无穷小生成器的稀疏性。使用容错硬件/软件系统的马尔可夫奖赏模型，我们说明了该方法的应用并分析了其计算成本。此外，我们将方法的计算成本与（仅）以前可用的方法的计算成本进行了比较。我们的数值实验似乎表明，该新方法可能是有效的，并且对于中型和大型模型，其速度可能比以前可用的方法快得多。

著录项

来源
《Dependable and Secure Computing, IEEE Transactions on》 |2011年第6期|p.798-809|共12页
作者
Carrasco Juan A.; Sune Victor;
展开▼
作者单位

Universitat Politècnica de Catalunya, Barcelona;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Fault tolerance; Markov reward models; modeling techniques; numerical algorithms.;

机译：容错;马尔可夫奖励模型;建模技术;数值算法;

相似文献

外文文献
中文文献
专利

1. Two methods for computing bounds for the distribution of cumulative reward for large Markov models [J] . Juan A. Carrasco Performance Evaluation . 2006,第12期

机译：两种计算大型马尔可夫模型累积奖励分布范围的方法
2. A fast algorithm for the transient reward distribution in continuous-time Markov chains [J] . Tijms HC., Veldman R. Operations Research Letters: A Journal of the Operations Research Society of America . 2000,第4期

机译：连续时间马尔可夫链中暂态奖励分配的快速算法
3. Transient Analysis of Idle Time in VANETs Using Markov-Reward Models [J] . Martin-Faus Isabel V., Urquiza-Aguiar Luis, Aguilar Igartua Monica, Fortschritte der Physik . 2018,第4期

机译：Markov-right模型的VANET中空闲时间的瞬态分析
4. Transient distributions of cumulative rate and impulse based reward with applications [C] . Edmundo De Souza E. Silva, H. Richard Gail, Joao Carlos Guedes IFIP TC7 Conference . 2000

机译：累计累计率的瞬态分布和基于诸如应用程序的奖励
5. Online Controlled Experiment Design: Trade-off Between Statistical Uncertainty and Cumulative Reward. [D] . Dai, Liang. 2014

机译：在线控制实验设计：在统计不确定性和累积奖励之间进行权衡。
6. Learning to maximize reward rate: a model based on semi-Markov decision processes [O] . Arash Khodadadi, Pegah Fakhari, Jerome R. Busemeyer 2014

机译：学习最大化奖励率：基于半马尔可夫决策过程的模型
7. A Numerical method for the evaluation of the distribution of cumulative reward till exit of a Subset of transient states of a Markov reward model [O] . Carrasco, Juan A., Suñé, Víctor 2011

机译：马尔可夫奖励模型瞬态子集累积回报分布的数值方法
8. Transient Analysis of Markov and Markov Reward Models [R] . Trivedi, K., Reibman, A., Smith, R. 1988

机译：马尔可夫和马尔可夫奖励模型的瞬态分析

A Numerical Method for the Evaluation of the Distribution of Cumulative Reward till Exit of a Subset of Transient States of a Markov Reward Model

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅