首页> 外文期刊>ACM transactions on computer systems >Cluster Communication Protocols for Parallel-Programming Systems
【24h】

Cluster Communication Protocols for Parallel-Programming Systems

机译:并行编程系统的群集通信协议

获取原文
获取原文并翻译 | 示例

摘要

Clusters of workstations are a popular platform for high-performance computing. For many parallel applications, efficient use of a fast interconnection network is essential for good performance. Several modern System Area Networks include programmable network interfaces that can be tailored to perform protocol tasks that otherwise would need to be done by the host processors. Finding the right trade-off between protocol processing at the host and the network interface is difficult in general. In this work, we systematically evaluate the performance of different implementations of a single, user-level communication interface. The implementations make different architectural assumptions about the reliability of the network and the capabilities of the network interface. The implementations differ accordingly in their division of protocol tasks between host software, network-interface firmware, and network hardware. Also, we investigate the effects of alternative data-transfer methods and multicast implementations, and we evaluate the influence of packet size. Using microbenchmarks, parallel-programming systems, and parallel applications, we assess the performance of the different implementations at multiple levels. We use two hardware platforms with different performance characteristics to validate our conclusions. We show how moving protocol tasks to a relatively slow network interface can yield both performance advantages and disadvantages, depending on specific characteristics of the application and the underlying parallel-programming system.
机译:工作站集群是用于高性能计算的流行平台。对于许多并行应用程序,有效使用快速互连网络对于获得良好性能至关重要。几种现代的系统区域网络包括可编程网络接口,可以对该接口进行定制以执行协议任务,否则主机处理器将需要完成这些任务。通常很难在主机和网络接口的协议处理之间找到合适的折衷方案。在这项工作中,我们系统地评估单个用户级通信接口的不同实现的性能。这些实现对网络的可靠性和网络接口的功能做出了不同的体系结构假设。相应的实现方式在主机软件,网络接口固件和网络硬件之间的协议任务划分上有所不同。此外,我们研究了替代数据传输方法和多播实现的影响,并评估了数据包大小的影响。使用微基准测试,并行编程系统和并行应用程序,我们可以在多个级别评估不同实现的性能。我们使用两个具有不同性能特征的硬件平台来验证我们的结论。我们将展示如何将协议任务移至相对较慢的网络接口,这取决于应用程序和底层并行编程系统的特定特性,这会同时带来性能优势和劣势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号