首页> 外文期刊>Concurrency, practice and experience >Scheduling data streams for low latency and high throughput on a Cray XC40 using Libfabric
【24h】

Scheduling data streams for low latency and high throughput on a Cray XC40 using Libfabric

机译:使用libfabric在CRAY XC40上的低延迟和高吞吐量的调度数据流

获取原文
获取原文并翻译 | 示例
           

摘要

Achieving efficient many-to-many communication on a given network topology is a challenging task when many data streams from different sources have to be scattered concurrently to many destinations with low variance in arrival times. In such scenarios, it is critical to saturate but not to congest the bisectional bandwidth of the network topology in order to achieve a good aggregate throughput. When there are many concurrent point-to-point connections, the communication pattern needs to be dynamically scheduled in a fine-grained manner to avoid network congestion (links, switches), overload in the node's incoming links, and receive buffer overflow. Motivated by the use case of the Compressed Baryonic Matter experiment (CBM), we study the performance and variance of such communication patterns on a Cray XC40 with different routing schemes and scheduling approaches. We present a distributed Data Flow Scheduler (DFS) that reduces the variance of arrival times from all sources at least 30 times and increases the achieved aggregate bandwidth by up to 50%.
机译:当来自不同源的许多数据流必须同时分散到到达时间下方的许多目的地时,在给定的网络拓扑上实现有效的多对多通信是一个具有挑战性的任务。在这种情况下,饱和但不是通过网络拓扑的二分配带宽来实现良好的聚合吞吐量至关重要。当存在许多并发点对点连接时,需要以细粒度的方式动态调度通信模式,以避免网络拥塞(链接,交换机),节点的传入链路过载,以及接收缓冲区溢出。通过压缩的放静电物质实验(CBM)的用例,我们研究了具有不同路由方案和调度方法的CRAY XC40上这种通信模式的性能和方差。我们介绍了一个分布式数据流量调度程序(DFS),从而减少了至少30次到达时间的差异,并将实现的聚合带宽增加到50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号