Tree-based fitted Q-iteration for multi-objective Markov decision processes in water resource management

F. Pianosi; A. Castelletti; M. Restelli

首页> 外文期刊>Journal of Hydroinformatics >Tree-based fitted Q-iteration for multi-objective Markov decision processes in water resource management

【24h】

Tree-based fitted Q-iteration for multi-objective Markov decision processes in water resource management

机译：水资源管理中多目标马尔可夫决策过程的基于树的拟合Q迭代

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-objective Markov decision processes (MOMDPs) provide an effective modeling framework for decision-making problems involving water systems. The traditional approach is to define many single-objective problems (resulting from different combinations of the objectives), each solvable by standard optimization. This paper presents an approach based on reinforcement learning (RL) that can learn the operating policies for all combinations of objectives in a single training process. The key idea is to enlarge the approximation of the action-value function, which is performed by single-objective RL over the state-action space, to the space of the objectives' weights. The batch-mode nature of the algorithm allows for enriching the training dataset without further interaction with the controlled system. The approach is demonstrated on a numerical test case study and evaluated on a real-world application, the Hoa Binh reservoir, Vietnam. Experimental results on the test case show that the proposed approach (multi-objective fitted Q-iteration; MOFQl) becomes computationally preferable over the repeated application of its single-objective version (fitted Q-iteration; FQi) when evaluating more than five weight combinations. In the Hoa Binh case study, the operating policies computed with MOFQl and FQI have comparable efficiency, while MOFQl provides a continuous

机译：多目标马尔可夫决策过程（MOMDP）为涉及供水系统的决策问题提供了有效的建模框架。传统方法是定义许多单目标问题（由目标的不同组合导致），每个问题都可以通过标准优化来解决。本文提出了一种基于强化学习（RL）的方法，该方法可以在单个培训过程中学习目标的所有组合的操作策略。关键思想是将在状态作用空间上由单目标RL执行的作用值函数的逼近扩大到目标权重的空间。该算法的批处理模式性质允许在不与受控系统进一步交互的情况下丰富训练数据集。该方法在数值测试案例研究中得到证明，并在越南Hoa Binh水库的实际应用中进行了评估。在测试用例上的实验结果表明，在评估五个以上权重组合时，所提出的方法（多目标拟合Q迭代； MOFQ1）在计算上优于单目标版本（拟合Q迭代； FQi）的重复应用。在Hoa Binh案例研究中，使用MOFQ1和FQI计算的操作策略具有可比的效率，而MOFQ1提供了连续的

著录项

来源
《Journal of Hydroinformatics》 |2013年第2期|258-270|共13页
作者
F. Pianosi; A. Castelletti; M. Restelli;
展开▼
作者单位

Dipartimento di Elettronica e Informazione,Politecnico di Milano Piazza L da Vinci,32,1-20133 Milano,Italy;

Dipartimento di Elettronica e Informazione,Politecnico di Milano Piazza L da Vinci,32,1-20133 Milano,Italy;

Dipartimento di Elettronica e Informazione,Politecnico di Milano Piazza L da Vinci,32,1-20133 Milano,Italy;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
multi-objective optimization; optimal control; reinforcement learning; reservoir operation; tree-based models;

机译：多目标优化;最佳控制;强化学习;水库作业;基于树的模型;

相似文献

外文文献
中文文献
专利

1. Markov decision processes in natural resources management: Observability and uncertainty [J] . Byron K. Williams Ecological Modelling . 2009,第6期

机译：马尔可夫自然资源管理中的决策过程：可观察性和不确定性
2. Application of a Multi-Person and Multi-Objective Decision-Making Model in Groundwater Resources Management [J] . Guo Li Wang, Li Li Huang, Guo Hua Liang Journal of hydrologic engineering . 2012,第3期

机译：多人多目标决策模型在地下水资源管理中的应用
3. A spatial multi-objective decision-making under uncertainty for water resources management [J] . Slobodan P. Simonovic, Nirupama Journal of Hydroinformatics . 2005,第2期

机译：不确定条件下水资源管理的空间多目标决策
4. Tree-based Fitted Q-iteration for Multi-Objective Markov Decision problems [C] . Castelletti Andrea, Pianosi Francesca, Restelli Marcello Neural Networks (IJCNN), The 2012 International Joint Conference on . 2012

机译：多目标马尔可夫决策问题的基于树的拟合Q迭代
5. Use of multi-objective particle swarm optimization in water resources management [D] . Baltar, Alexandre Moreira 2007

机译：多目标粒子群算法在水资源管理中的应用
6. Multi-Objective Markov Decision Processes for Data-Driven Decision Support [O] . Daniel J. Lizotte, Eric B. Laber -1

机译：数据驱动决策支持的多目标马尔可夫决策过程
7. Tree-based Fitted Q-iteration for Multi-Objective Markov Decision problems [O] . A. Castelletti, F. Pianosi, M. Restelli 2012

机译：多目标马尔可夫决策问题的基于树的拟合Q迭代
8. Two Short Notes on Markov Processes: I. A Test for Sub-Optimal Actions in Markovian Decision Problems. II. An Intrinsically Determined Markov Chain [R] . MacQueen, J. B. 1966

机译：关于马尔可夫过程的两个简短说明：I。马尔可夫决策问题中次优最优行动的检验。 II。本质上确定的马尔可夫链

Tree-based fitted Q-iteration for multi-objective Markov decision processes in water resource management

摘要

著录项

相似文献

相关主题

期刊订阅