首页> 外文会议>Supercomputing, 1998. SC98. IEEE/ACM Conference on >User-Space Communication: A Quantitative Study
【24h】

User-Space Communication: A Quantitative Study

机译:用户空间通信:定量研究

获取原文

摘要

Powerful commodity systems and networks offer a promising direction for high performance computing because they are inexpensive and they closely track technology progress. However, high, raw-hardware performance is rarely delivered to the end user. Previous work has shown that the bottleneck in these architectures is the overheads imposed by the software communication layer. To reduce these overheads, researchers have proposed a number of user-space communication models. The common feature of these models is that applications have direct access to the network, bypassing the operating system in the common case and thus avoiding the cost of send/receive system calls. In this paper we examine five user-space communication layers, that represent different points in the configuration space: Generic AM, BIP-0.92, FM-2.02, PM-1.2, and VMMC-2. Although these systems support different communication paradigms and employ a variety of different implementation tradeoffs, we are able to quantitatively compare them on a single testbed consisting of a cluster of high-end PCs connected by a Myrinet network. We find that all five communication systems have very low latency for small messages, in the range of 5 to 17 s. Not surprisingly, this range is strongly influenced by the functionality offered by each system. We are encouraged, however, to find that features such as protected and reliable communication at user level and multiprogramming can be provided at very low cost. Bandwidth, however, depends primarily on how data is transferred between host memory and the network. Most of the investigated libraries support zero-copy protocols for certain types of data transfers, but differ significantly in the bandwidth delivered to end users. The highest bandwidth, between 95 and 125 MBytes/s for long message transfers, is delivered by libraries that use DMA on both send and receive sides and avoid all data copies. Libraries that perform additional data copies or use programmed I/O to send data to the network achieve lower maximum bandwidth, in the range of 60-70 MBytes/s.
机译:强大的商品系统和网络价格便宜,并且密切跟踪技术进步,因此为高性能计算提供了一个有希望的方向。但是,原始硬件的高性能很少能提供给最终用户。先前的工作表明,这些体系结构的瓶颈是软件通信层带来的开销。为了减少这些开销,研究人员提出了许多用户空间通信模型。这些模型的共同特征是应用程序可以直接访问网络,而在通常情况下绕过操作系统,从而避免了发送/接收系统调用的成本。在本文中,我们检查了五个用户空间通信层,它们代表配置空间中的不同点:通用AM,BIP-0.92,FM-2.02,PM-1.2和VMMC-2。尽管这些系统支持不同的通信范例并采用各种不同的实现折衷,但我们仍能够在单个测试台上进行定量比较,该测试台由通过Myrinet网络连接的一组高端PC组成。我们发现,所有五个通信系统对于小消息的等待时间都非常低,在5到17 s的范围内。毫不奇怪,该范围受每个系统提供的功能的强烈影响。但是,我们受到鼓舞,发现可以以非常低的成本提供诸如在用户级别进行受保护的可靠通信以及多重编程之类的功能。但是,带宽主要取决于主机存储器和网络之间数据的传输方式。大多数调查的库都支持零复制协议以用于某些类型的数据传输,但是在交付给最终用户的带宽方面存在很大差异。在长消息传输中,最高带宽在95到125 MBytes / s之间,是由在发送和接收端都使用DMA并避免所有数据副本的库提供的。执行附加数据副本或使用编程的I / O将数据发送到网络的库可实现较低的最大带宽,范围为60-70 MBytes / s。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号