首页> 外文期刊>Journal of Hydrology >A game theory-reinforcement learning (GT-RL) method to develop optimal operation policies for multi-operator reservoir systems
【24h】

A game theory-reinforcement learning (GT-RL) method to develop optimal operation policies for multi-operator reservoir systems

机译:博弈论强化学习(GT-RL)方法,用于为多运营商储层系统开发最优运营策略

获取原文
获取原文并翻译 | 示例
       

摘要

Reservoir systems with multiple operators can benefit from coordination of operation policies. To maximize the total benefit of these systems the literature has normally used the social planner's approach. Based on this approach operation decisions are optimized using a multi-objective optimization model with a compound system's objective. While the utility of the system can be increased this way, fair allocation of benefits among the operators remains challenging for the social planner who has to assign controversial weights to the system's beneficiaries and their objectives. Cooperative game theory provides an alternative framework for fair and efficient allocation of the incremental benefits of cooperation. To determine the fair and efficient utility shares of the beneficiaries, cooperative game theory solution methods consider the gains of each party in the status quo (non-cooperation) as well as what can be gained through the grand coalition (social planner's solution or full cooperation) and partial coalitions. Nevertheless, estimation of the benefits of different coalitions can be challenging in complex multibeneficiary systems. Reinforcement learning can be used to address this challenge and determine the gains of the beneficiaries for different levels of cooperation, i.e., non-cooperation, partial cooperation, and full cooperation, providing the essential input for allocation based on cooperative game theory. This paper develops a game theory-reinforcement learning (GT-RL) method for determining the optimal operation policies in multi-operator multi-reservoir systems with respect to fairness and efficiency criteria. As the first step to underline the utility of the GT-RL method in solving complex multi-agent multi-reservoir problems without a need for developing compound objectives and weight assignment, the proposed method is applied to a hypothetical three-agent three-reservoir system. (C) 2014 Elsevier B.V. All rights reserved.
机译:具有多个运营商的储层系统可以受益于运营政策的协调。为了使这些系统的总利益最大化,文献通常使用社会计划者的方法。基于这种方法,使用具有复合系统目标的多目标优化模型来优化操作决策。尽管可以通过这种方式增加系统的效用,但是对于必须为系统的受益人及其目标分配有争议的权重的社会计划者,在运营商之间公平分配利益仍然具有挑战性。合作博弈理论为公平有效地分配合作的增量利益提供了另一种框架。为了确定受益人的公平和有效的效用份额,合作博弈理论的解决方法考虑了当事方在现状(不合作)下的收益以及通过大联盟(社会计划者的解决方案或充分合作)所能获得的收益。 )和部分联盟。然而,在复杂的多受益人系统中,估计不同联盟的利益可能是具有挑战性的。强化学习可用于应对这一挑战并确定受益人在不同程度的合作(即不合作,部分合作和完全合作)下的收益,从而为基于合作博弈理论进行分配提供了必要的输入。本文开发了一种基于博弈论的强化学习(GT-RL)方法,用于根据公平性和效率标准确定多运营商多水库系统的最优运营策略。作为强调使用GT-RL方法解决复杂的多主体多水库问题而无需制定复合目标和权重分配的第一步,该方法被应用于假设的三主体三水库系统。 (C)2014 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号