...
首页> 外文期刊>Water Resources Management >Deep Reinforcement Learning for Cascaded Hydropower Reservoirs Considering Inflow Forecasts
【24h】

Deep Reinforcement Learning for Cascaded Hydropower Reservoirs Considering Inflow Forecasts

机译:考虑流入预测的级联水电站水库深度加固学习

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper develops a deep reinforcement learning (DRL) framework for intelligence operation of cascaded hydropower reservoirs considering inflow forecasts, in which two key problems of large discrete action spaces and uncertainty of inflow forecasts are addressed. In this study, a DRL framework is first developed based on a newly defined knowledge sample form and a deep Q-network (DQN). Then, an aggregation-disaggregation model is used to reduce the multi-dimension spaces of state and action for cascaded reservoirs. Following, three DRL models are developed respectively to evaluate the performance of the newly defined decision value functions and modified decision action selection approach. In this paper, the DRL methodologies are tested on China's Hun River cascade hydropower reservoirs system. The results show that the aggregation-disaggregation model can effectively reduce the dimensions of state and action, which also makes the model structure simpler and has higher learning efficiency. The Bayesian theory in the decision action selection approach is useful to address the uncertainty of inflow forecasts, which can improve the performance to reduce spillages during the wet season. The proposed DRL models outperform the comparison models (i.e., stochastic dynamic programming) in terms of annual hydropower generation and system reliability. This study suggests that the DRL has the potential to be implemented in practice to derive optimal operation strategies.
机译:本文开发了考虑流入预测的级联水电站智能运行的深增强学习(DRL)框架,其中解决了大型离散行动空间的两个关键问题和流入预测的不确定性。在本研究中,首先基于新定义的知识样本和深Q网络(DQN)开发DRL框架。然后,使用聚合 - 分组模型来减少级联储存器的状态和动作的多维空间。以下,分别开发了三种DRL模型,以评估新定义的决策值函数和修改决策动作选择方法的性能。本文在中国浑河级联水电站系统测试了DRL方法。结果表明,聚集 - 分组模型可以有效地降低状态和动作的尺寸,这也使模型结构更简单并具有更高的学习效率。决策措施选择方法中的贝叶斯理论可用于解决流入预测的不确定性,这可以提高潮湿季节减少溢出的性能。所提出的DRL模型在年度水电站和系统可靠性方面优于比较模型(即随机动态编程)。本研究表明,DRL有可能在实践中实施以推导出最佳的运作策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号