TCCluster: A Cluster Architecture Utilizing the Processor Host Interface as a Network Interconnect

机译：TCCluster：使用处理器主机接口作为网络互连的群集体系结构

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

So far, large computing clusters consisting of several thousand machines have been constructed by connecting nodes together using interconnect technologies as e.g. Ethernet, Infiniband or Myrinet. We propose an entirely new architecture called Tightly Coupled Cluster (TCCluster) that instead uses the native host interface of the processors as a direct network interconnect. This approach offers higher bandwidth and much lower communication latencies than the traditional approaches by virtually integrating the network interface adapter into the processor. Our technique neither applies any modifications to the processor nor requires any additional hardware. Instead, we use commodity off the shelf AMD processors and exploit the HyperTransport host interface as a cluster interconnect. Our approach is purely software based and does not require any additional hardware nor modifications to the existing processors. In this paper, we explain the addressing of nodes in such a cluster, the routing within such a system and the programming model that can be applied. We present a detailed description of the tasks that need to be addressed and provide a proof of concept implementation. For the evaluation of our technique a two node TCCluster prototype is presented. Therefore, the BIOS firmware, a custom Linux kernel and a small message library has been developed. We present microbenchmarks that show a sustained bandwidth of up to 2500 MB/s for messages as small as 64 Byte and a communication latency of 227 ns between two nodes outperforming other high performance networks by an order of magnitude.

机译：到目前为止，已经通过使用互连技术将节点连接在一起而构建了由数千台机器组成的大型计算集群。以太网，Infiniband或Myrinet。我们提出了一种全新的体系结构，称为紧密耦合群集（TCCluster），该体系结构将处理器的本机主机接口用作直接的网络互连。通过将网络接口适配器虚拟集成到处理器中，该方法提供了比传统方法更高的带宽和更低的通信延迟。我们的技术既不对处理器进行任何修改，也不需要任何其他硬件。取而代之的是，我们使用现成的AMD处理器，并利用HyperTransport主机接口作为集群互连。我们的方法完全基于软件，不需要任何额外的硬件，也不需要修改现有的处理器。在本文中，我们解释了此类集群中节点的寻址，此类系统内的路由以及可应用的编程模型。我们对需要解决的任务进行了详细描述，并提供了概念证明。为了评估我们的技术，提出了一个两节点TCCluster原型。因此，已经开发了BIOS固件，自定义Linux内核和小型消息库。我们提出了微基准测试，对于小至64字节的消息，它显示出高达2500 MB / s的持续带宽，并且两个节点之间的通信延迟为227 ns，其性能比其他高性能网络高出一个数量级。

著录项

来源
《2010 IEEE International Conference on Cluster Computing》|2010年|p.9-18|共10页
会议地点
作者
Litz Heiner; Thuermer Maximilian; Bruening Ulrich;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类分子生物学;
关键词
AMD; HPC; HyperTransport; Low latency; Opteron; high bandwidth; interconnect;

机译：AMD; HPC;超传输;低延迟; Opteron;高带宽;互连;

相似文献

外文文献
中文文献
专利

1. Synthesis of Predictable Networks-on-Chip-Based Interconnect Architectures for Chip Multiprocessors [J] . Murali S., Atienza D., Meloni P., IEEE transactions on very large scale integration (VLSI) systems . 2007,第8期

机译：面向芯片多处理器的基于可预测片上网络的互连体系结构综合
2. Cluster Based Networks-on-Chip: An Efficient and Fault-Tolerant Architecture using Network Interface Assisted Routing [J] . Khalid Latif, Amir-Mohammad Rahmani, Tiberiu Seceleanu, International journal of adaptive, resilient, and autonomic systems . 2013,第3期

机译：基于群集的片上网络：使用网络接口辅助路由的高效且容错的体系结构
3. Scalable ATM network interface design using parallel RISC processors architecture [J] . Ali Elkateeb, Paul Richardson, Adnan Shaout, Microprocessors and microsystems . 2004,第9期

机译：使用并行RISC处理器体系结构的可扩展ATM网络接口设计
4. TCCluster: A Cluster Architecture Utilizing the Processor Host Interface as a Network Interconnect [C] . Litz Heiner, Thuermer Maximilian, Bruening Ulrich 2010 IEEE International Conference on Cluster Computing . 2010

机译：TCCluster：使用处理器主机接口作为网络互连的群集体系结构
5. Modeling and analysis of router architectures and network interface architecture for network on chip. [D] . Singh, Sanjay Pratap. 2006

机译：片上网络的路由器体系结构和网络接口体系结构的建模和分析。
6. Photo-induced transformation process at gold clusters-semiconductor interface: Implications for the complexity of gold clusters-based photocatalysis [O] . Siqi Liu, Yi-Jun Xu -1

机译：金团簇-半导体界面处的光诱导转变过程：对基于金团簇的光催化的复杂性的影响
7. A Hexagonal Processor and Interconnect Topology for Many-Core Architecture with Dense On-Chip Networks [O] . Xiao, Zhibin, Baas, Bevan 2012

机译：具有密集片上网络的多核体系结构的六边形处理器和互连拓扑

TCCluster: A Cluster Architecture Utilizing the Processor Host Interface as a Network Interconnect

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅