首页> 外文会议>2010 IEEE International Conference on Cluster Computing >TCCluster: A Cluster Architecture Utilizing the Processor Host Interface as a Network Interconnect
【24h】

TCCluster: A Cluster Architecture Utilizing the Processor Host Interface as a Network Interconnect

机译:TCCluster:使用处理器主机接口作为网络互连的群集体系结构

获取原文

摘要

So far, large computing clusters consisting of several thousand machines have been constructed by connecting nodes together using interconnect technologies as e.g. Ethernet, Infiniband or Myrinet. We propose an entirely new architecture called Tightly Coupled Cluster (TCCluster) that instead uses the native host interface of the processors as a direct network interconnect. This approach offers higher bandwidth and much lower communication latencies than the traditional approaches by virtually integrating the network interface adapter into the processor. Our technique neither applies any modifications to the processor nor requires any additional hardware. Instead, we use commodity off the shelf AMD processors and exploit the HyperTransport host interface as a cluster interconnect. Our approach is purely software based and does not require any additional hardware nor modifications to the existing processors. In this paper, we explain the addressing of nodes in such a cluster, the routing within such a system and the programming model that can be applied. We present a detailed description of the tasks that need to be addressed and provide a proof of concept implementation. For the evaluation of our technique a two node TCCluster prototype is presented. Therefore, the BIOS firmware, a custom Linux kernel and a small message library has been developed. We present microbenchmarks that show a sustained bandwidth of up to 2500 MB/s for messages as small as 64 Byte and a communication latency of 227 ns between two nodes outperforming other high performance networks by an order of magnitude.
机译:到目前为止,已经通过使用互连技术将节点连接在一起而构建了由数千台机器组成的大型计算集群。以太网,Infiniband或Myrinet。我们提出了一种全新的体系结构,称为紧密耦合群集(TCCluster),该体系结构将处理器的本机主机接口用作直接的网络互连。通过将网络接口适配器虚拟集成到处理器中,该方法提供了比传统方法更高的带宽和更低的通信延迟。我们的技术既不对处理器进行任何修改,也不需要任何其他硬件。取而代之的是,我们使用现成的AMD处理器,并利用HyperTransport主机接口作为集群互连。我们的方法完全基于软件,不需要任何额外的硬件,也不需要修改现有的处理器。在本文中,我们解释了此类集群中节点的寻址,此类系统内的路由以及可应用的编程模型。我们对需要解决的任务进行了详细描述,并提供了概念证明。为了评估我们的技术,提出了一个两节点TCCluster原型。因此,已经开发了BIOS固件,自定义Linux内核和小型消息库。我们提出了微基准测试,对于小至64字节的消息,它显示出高达2500 MB / s的持续带宽,并且两个节点之间的通信延迟为227 ns,其性能比其他高性能网络高出一个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号