A Modified Value Iteration Algorithm for Discounted Markov Decision Processes

Sanaa Chafik; Cherki Daoui

首页> 外文期刊>Journal of Electronic Commerce in Organizations >A Modified Value Iteration Algorithm for Discounted Markov Decision Processes

【24h】

A Modified Value Iteration Algorithm for Discounted Markov Decision Processes

机译：折扣马尔可夫决策过程的改进值迭代算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

As many real applications need a large amount of states, the classical methods are intractable for solving large Markov Decision Processes. The decomposition technique basing on the topology of each state in the associated graph and the parallelization technique are very useful methods to cope with this problem. In this paper, the authors propose a Modified Value Iteration algorithm, adding the parallelism technique. They test their implementation on artificial data using an Open MP that offers a significant speed-up.

机译：由于许多实际应用程序需要大量状态，因此经典方法难以解决大型马尔可夫决策过程。基于关联图中每个状态的拓扑的分解技术和并行化技术是解决此问题的非常有用的方法。在本文中，作者提出了一种Modified Value Iteration算法，其中增加了并行技术。他们使用可大大提高速度的Open MP在人工数据上测试其实现。

著录项

来源
《Journal of Electronic Commerce in Organizations》 |2015年第3期|47-57|共11页
作者
Sanaa Chafik; Cherki Daoui;
展开▼
作者单位

Laboratory of Information Processing and Decision Support, University Sultan Moulay Slimane, Beni Mellal, Morocco;

Laboratory of Information Processing and Decision Support, University Sultan Moulay Slimane, Beni Mellal, Morocco;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Discounted Reward Criterion; Markov Decision Processes; Open MP; Parallelizing; Value Iteration Algorithm;

机译：折扣奖励标准;马尔可夫决策过程;打开MP;并行化值迭代算法;
入库时间 2022-08-17 13:38:37

相似文献

外文文献
中文文献
专利

1. Accelerated modified policy iteration algorithms for Markov decision processes [J] . Oleksandr Shlakhter, Chi-Guhn Lee Mathematical Methods of Operations Research . 2013,第1期

机译：马尔可夫决策过程的加速修改策略迭代算法
2. Accelerated modified policy iteration algorithms for Markov decision processes [J] . Shlakhter O., Lee C.-G. Mathematical methods of operations research . 2013,第1期

机译：马尔可夫决策过程的加速修改策略迭代算法
3. Hierarchical algorithms for discounted and weighted Markov decision processes [J] . M. Abbad, C. Daoui Mathematical methods of operations research . 2003,第2期

机译：折现和加权马尔可夫决策过程的层次算法
4. The complexity of Policy Iteration is exponential for discounted Markov Decision Processes [C] . Hollanders Romain IEEE Conference on Decision and Control;CDC . 2012

机译：对于折现马尔可夫决策过程，策略迭代的复杂性呈指数级增长
5. A Markovian Optimization Model for Pavement Maintenance Using Policy Iteration Algorithm with Discounted Road-user and Agency Costs [D] . Narh-Dometey, Anita. 2019

机译：利用折扣道路用户和机构成本的策略迭代算法的路面维护马尔瓦维亚优化模型
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. Iteration Algorithms in Markov Decision Processes with State- Action-Dependent Discount Factors and Unbounded Costs [O] . Fernando Luque-Vásquez, J. Adolfo Minjárez-Sosa 2016

机译：Markov决策过程中的迭代算法，具有依赖折扣因子和无限性成本

A Modified Value Iteration Algorithm for Discounted Markov Decision Processes

摘要

著录项

相似文献

相关主题

期刊订阅