...
首页> 外文期刊>Future generation computer systems >Multi-agent deep learning for simultaneous optimization for time and energy in distributed routing system
【24h】

Multi-agent deep learning for simultaneous optimization for time and energy in distributed routing system

机译:多主体深度学习可同时优化分布式路由系统中的时间和精力

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Packet routing problem most commonly emerges in the context of computer networks, thus the majority of routing algorithms existing nowadays is designed specifically for routing in computer networks. However, in the logistics domain, many problems can be formulated in terms of packet routing, e.g. in automated traffic routing or material handling systems. In this paper, we propose an algorithm for packet routing in such heterogeneous environments. Our approach is based on deep reinforcement learning networks combined with link-state protocol and preliminary supervised learning. Similarly to most routing algorithms, the proposed algorithm is a distributed one and is designed to run on a network constructed from interconnected routers. Unlike most other algorithms, proposed one views routers as learning agents, representing the routing problem as a multi-agent reinforcement learning problem. Modeling each router as a deep neural network allows each router to account for heterogeneous data about its environment, allowing for optimization of more complex cost functions, like in case of simultaneous optimization of bag delivery time and energy consumption in a baggage handling system. We tested the algorithm using manually constructed simulation models of computer network and baggage handling system. It outperforms state-of-the-art routing algorithms. (C) 2018 Elsevier B.V. All rights reserved.
机译:分组路由问题最普遍出现在计算机网络的环境中,因此,当今存在的大多数路由算法都是专门为计算机网络中的路由而设计的。然而,在物流领域,可以根据分组路由来提出许多问题,例如在自动交通路线或物料处理系统中。在本文中,我们提出了一种在这种异构环境中进行分组路由的算法。我们的方法基于结合了链接状态协议和初步监督学习的深度强化学习网络。与大多数路由算法相似,该算法是一种分布式算法,旨在在由互连路由器构成的网络上运行。与大多数其他算法不同,提出的人将路由器视为学习代理,将路由问题表示为多代理强化学习问题。将每个路由器建模为一个深度神经网络,可以使每个路由器考虑到有关其环境的异构数据,从而可以优化更复杂的成本函数,例如在行李处理系统中同时优化行李运送时间和能耗的情况下。我们使用计算机网络和行李处理系统的手动构建的仿真模型对算法进行了测试。它优于最新的路由算法。 (C)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号