Percentile performance criteria for limiting average Markov decision processes

Filar J.A.; Krass D.

首页> 外文期刊>IEEE Transactions on Automatic Control >Percentile performance criteria for limiting average Markov decision processes

【24h】

Percentile performance criteria for limiting average Markov decision processes

机译：限制平均马尔可夫决策过程的百分比性能标准

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Addresses the following basic feasibility problem for infinite-horizon Markov decision processes (MDPs): can a policy be found that achieves a specified value (target) of the long-run limiting average reward at a specified probability level (percentile)? Related optimization problems of maximizing the target for a specified percentile and vice versa are also considered. The authors present a complete (and discrete) classification of both the maximal achievable target levels and of their corresponding percentiles. The authors also provide an algorithm for computing a deterministic policy corresponding to any feasible target-percentile pair. Next the authors consider similar problems for an MDP with multiple rewards and/or constraints. This case presents some difficulties and leads to several open problems. An LP-based formulation provides constructive solutions for most cases.

机译：解决了无限水平马尔可夫决策过程（MDP）的以下基本可行性问题：是否可以找到一种策略，以指定的概率水平（百分位数）实现长期限制平均奖励的指定值（目标）？还考虑了将目标最大化指定百分位数（反之亦然）的相关优化问题。作者提出了最大可达到的目标水平及其相应百分位数的完整（和离散）分类。作者还提供了一种算法，用于计算与任何可行的目标百分位数对相对应的确定性策略。接下来，作者考虑具有多个奖励和/或约束的MDP的类似问题。这种情况带来一些困难，并导致一些未解决的问题。基于LP的配方为大多数情况提供了建设性的解决方案。

著录项

来源
《IEEE Transactions on Automatic Control》 |1995年第1期|P.2-10|共9页
作者
Filar J.A.; Krass D.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词

相似文献

外文文献
中文文献
专利

1. Percentile performance criteria for limiting average Markovdecision processes [J] . Filar J.A., Krass D., Ross K.W. IEEE Transactions on Automatic Control . 1995,第1期

机译：限制平均马尔可夫决策过程的百分比性能标准
2. Limiting average criteria for nonstationary Markov decision processes [J] . Guo XP., Shi P. SIAM Journal on Optimization: A Publication of the Society for Industrial and Applied Mathematics . 2001,第4期

机译：非平稳马尔可夫决策过程的极限平均标准
3. Semi-Markov decision processes with limiting ratio average rewards [J] . Sinha Sagnik, Mondal Prasenjit Journal of Mathematical Analysis and Applications . 2017,第1期

机译：半马尔可夫决策过程，限制比率奖励
4. Percentile objective criteria in limiting average Markov controlproblems [C] . Filar J.A., Krass D., Ross K.W. 1993 International Joint Conference on Neural Networks, 1993. IJCNN '93-Nagoya, 1993 . 2001

机译：限制平均马尔可夫控制问题的百分比客观标准
5. Information retrieval performance enhancement using the average standard estimator and the multi-criteria decision weighted set of performance measures. [D] . Ahram, Tareq Z. 2008

机译：使用平均标准估计量和性能评估的多标准决策加权集增强信息检索性能。
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. Percentile performance criteria for limiting average Markov Decision Processes [O] . Filar, Jerzy A, Krass, Dmitry, Ross, Keith W 1995

机译：限制平均马尔可夫决策过程的百分比性能标准

Percentile performance criteria for limiting average Markov decision processes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅