首页> 外文期刊>Journal of Parallel and Distributed Computing >Feasible enhancements to congestion control in InfiniBand-based networks
【24h】

Feasible enhancements to congestion control in InfiniBand-based networks

机译:在基于InfiniBand的网络中对拥塞控制进行切实可行的增强

获取原文
获取原文并翻译 | 示例
           

摘要

The interconnection network architecture is crucial for High-Performance Computing (HPC) clusters, since it must meet the increasing computing demands of applications. Current trends in the design of these networks are based on increasing link speed, while reducing latency and number of components in order to lower the cost. The InfiniBand Architecture (1BA) is an example of a powerful interconnect technology, delivering huge amounts of information in few microseconds. The IBA-based hardware is able to deliver EDR and HDR speed (i.e. 100 and 200 Gb/s, respectively). Unfortunately, congestion situations and their derived problems (i.e. Head-of-Line blocking and buffer hogging), are a serious threat for the performance of both the interconnection network and the entire HPC cluster. In this paper, we propose a new approach to provide IBA-based networks with techniques for reducing the congestion problems. We propose Flow2SL-ITh, a technique that combines a static queuing scheme (SQS) with the closed-loop congestion control mechanism included in IBA-based hardware (a.k.a. injection throttling, ITh). Flow2SL-ITh separates traffic flows storing them in different virtual lanes (VLs), in order to reduce HoL blocking, while the injection rate of congested flows is throttled. Meanwhile congested traffic vanishes, there is no buffer sharing among traffic flows stored in different VLs, which reduces congestion negative effects. We have implemented Flow2SL-ITh in OpenSM, the open-source implementation of the IBA subnet manager (SM). Experimental results obtained by running simulations and real workloads in a small IBA cluster show that Flow2SL-ITh outperforms existing techniques by up to 44%, under some traffic scenarios.
机译:互连网络体系结构对于高性能计算(HPC)群集至关重要,因为它必须满足应用程序不断增长的计算需求。这些网络设计的当前趋势是基于提高链路速度,同时减少等待时间和组件数量以降低成本。 InfiniBand体系结构(1BA)是强大的互连技术的一个示例,可在几微秒内提供大量信息。基于IBA的硬件能够提供EDR和HDR速度(分别为100和200 Gb / s)。不幸的是,拥塞情况及其引起的问题(即行头阻塞和缓冲区占用)对互连网络和整个HPC群集的性能均构成严重威胁。在本文中,我们提出了一种新的方法来为基于IBA的网络提供减少拥塞问题的技术。我们提出了Flow2SL-ITh,该技术将静态排队方案(SQS)与基于IBA的硬件(也称为注入节流,ITh)中包含的闭环拥塞控制机制相结合。 Flow2SL-ITh分离了将流量存储在不同的虚拟通道(VL)中的流量,以减少HoL阻塞,同时限制了拥塞流量的注入速率。同时,拥塞的流量消失了,存储在不同VL中的流量之间没有缓冲区共享,从而减少了拥塞的负面影响。我们已经在OpenSM(IBA子网管理器(SM)的开源实现)中实现了Flow2SL-ITh。通过在小型IBA群集中运行仿真和实际工作负载获得的实验结果表明,在某些流量情况下,Flow2SL-ITh的性能比现有技术高出44%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号