首页> 外文OA文献 >Deep Multi-Agent Reinforcement Learning using DNN-Weight Evolution to Optimize Supply Chain Performance
【2h】

Deep Multi-Agent Reinforcement Learning using DNN-Weight Evolution to Optimize Supply Chain Performance

机译:利用DNN权重进化深度多智能体强化学习优化供应链绩效

摘要

To develop a supply chain management (SCM) system that performs optimally for both each entity in the chain and the entire chain, a multi-agent reinforcement learning (MARL) technique has been developed. To solve two problems of the MARL for SCM (building a Markov decision processes for a supply chain and avoiding learning stagnation in a way similar to the "prisoner's dilemma"), a learning management method with deep-neural-network (DNN)-weight evolution (LM-DWE) has been developed. By using a beer distribution game (BDG) as an example of a supply chain, experiments with a four-agent system were performed. Consequently, the LM-DWE successfully solved the above two problems and achieved 80.0% lower total cost than expert players of the BDG.
机译:为了开发对链中每个实体和整个链均表现最佳的供应链管理(SCM)系统,已开发了多主体强化学习(MARL)技术。为了解决SCM的MARL的两个问题(为供应链建立马尔可夫决策过程并以类似于“囚徒困境”的方式避免学习停滞),这是一种具有深度神经网络(DNN)权重的学习管理方法进化(LM-DWE)已经开发出来。通过使用啤酒分销游戏(BDG)作为供应链的示例,进行了四人系统的实验。因此,LM-DWE成功解决了上述两个问题,并且总成本比BDG的专家参与者降低了80.0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号