Hierarchical algorithms for discounted and weighted Markov decision processes

M. Abbad; C. Daoui

首页> 外文期刊>Mathematical methods of operations research >Hierarchical algorithms for discounted and weighted Markov decision processes

【24h】

Hierarchical algorithms for discounted and weighted Markov decision processes

机译：折现和加权马尔可夫决策过程的层次算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider a discrete time finite Markov decision process (MDP) with the discounted and weighted reward optimality criteria. In [1] the authors considered some decomposition of limiting average MDPs. In this paper, we use an analogous approach for discounted and weighted MDPs. Then, we construct some hierarchical decomposition algorithms for both discounted and weighted MDPs.

机译：我们考虑具有折扣和加权奖励最优标准的离散时间有限马尔可夫决策过程（MDP）。在[1]中，作者考虑了极限平均MDP的一些分解。在本文中，我们对折现和加权MDP使用类似的方法。然后，我们为折价和加权MDP构造了一些层次分解算法。

著录项

来源
《Mathematical methods of operations research》 |2003年第2期|共9页
作者
M. Abbad; C. Daoui;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数学;
关键词
discounted MDP; weighted MDP; decomposition; strongly connected classes; graph theory;

机译：折现MDP;加权MDP;分解;强关联类;图论;

相似文献

外文文献
中文文献
专利

1. Hierarchical algorithms for discounted and weighted Markov decision processes [J] . M. Abbad, C. Daoui Mathematical methods of operations research . 2003,第2期

机译：折现和加权马尔可夫决策过程的层次算法
2. Actor-critic algorithms for hierarchical Markov decision processes [J] . Bhatnagar S, Panigrahi JR Automatica . 2006,第4期

机译：层次马尔可夫决策过程的参与者评论算法
3. Simplex Algorithm for Countable-State Discounted Markov Decision Processes [J] . Lee Ilbin, Epelman Marina A., Romeijn H. Edwin, Operations Research: The Journal of the Operations Research Society of America . 2017,第4期

机译：单纯x可数状态折扣马尔可夫决策过程的算法
4. Weighted difference approximation of value functions for slow-discounting Markov Decision Processes [C] . Yin-Lam Chow, Junjie Qin IEEE Annual Conference on Decision and Control . 2014

机译：慢折扣马尔可夫决策过程的值函数的加权差分近似
5. Increasing scalability in algorithms for centralized and decentralized partially observable Markov decision processes: Efficient decision-making and coordination in uncertain environments. [D] . Amato, Christopher. 2010

机译：用于集中式和分散式部分可观察的马尔可夫决策过程的算法中的可伸缩性不断增强：在不确定的环境中进行有效的决策和协调。
6. Classification of bioinformatics workflows using weighted versions of partitioning and hierarchical clustering algorithms [O] . Etienne Lord, Abdoulaye Baniré Diallo, Vladimir Makarenkov 2015

机译：使用分区和分层聚类算法的加权版本对生物信息学工作流进行分类
7. Iteration Algorithms in Markov Decision Processes with State- Action-Dependent Discount Factors and Unbounded Costs [O] . Fernando Luque-Vásquez, J. Adolfo Minjárez-Sosa 2016

机译：Markov决策过程中的迭代算法，具有依赖折扣因子和无限性成本

Hierarchical algorithms for discounted and weighted Markov decision processes

摘要

著录项

相似文献

相关主题

期刊订阅