首页> 外文会议>Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2012 IEEE International >A 4.5Tb/s 3.4Tb/s/W 64×64 switch fabric with self-updating least-recently-granted priority and quality-of-service arbitration in 45nm CMOS
【24h】

A 4.5Tb/s 3.4Tb/s/W 64×64 switch fabric with self-updating least-recently-granted priority and quality-of-service arbitration in 45nm CMOS

机译:4.5Tb / s 3.4Tb / s / W 64×64交换矩阵,具有自更新的最近授予的最低优先级和45nm CMOS的服务质量仲裁

获取原文
获取原文并翻译 | 示例

摘要

High-speed and low-power routers form the basic building blocks of on-die interconnect fabrics that are critical to overall throughput and energy efficiency of high performance systems. Conventional routers use distinct logic blocks for routing data and handling arbitration. At higher radices, connections between these blocks become a bottleneck, limiting router scalability and degrading performance. Recently, two switch topologies merged the data routing fabric with arbitration control, avoiding this bottleneck. However, relies on centralized control for channel allocation, limiting performance, while restricted to a small set of fixed priorities, rendering input ports prone to starvation. In addition, ever larger CMPs will require continued increases in bandwidth over previous designs. To address these issues, we present a 64x64 single-stage swizzle-switch network (SSN) with 128b data buses (8192 total input/output wires). The SSN can connect any input to any output, including multicast. It has a peak measured throughput of 4.5Tb/s at 1.1V in 45nm SOI CMOS at 25°C. The SSN''s key features are: 1) a single-cycle least-recently granted (LRG) priority arbitration technique that reuses the already present input and output data buses and their drivers and sense amps; 2) an additional 4-level message-based priority arbitration for quality of service (QoS) with 2% logic and 3% wiring overhead; 3) a bidirectional bitline repeater that allows the router to scale to >;8000 wires. These features result in a compact fabric (4.06mm2) with throughput gain of 2.1 x over at 3.4Tb/s/W efficiency, which improves to 7.4Tb/s/W at 600mV.
机译:高速和低功耗路由器构成了片上互连结构的基本构建模块,这些模块对高性能系统的整体吞吐量和能效至关重要。常规路由器使用不同的逻辑块来路由数据和处理仲裁。在更高的半径下,这些模块之间的连接成为瓶颈,从而限制了路由器的可扩展性并降低了性能。最近,两种交换拓扑将数据路由结构与仲裁控制合并在一起,避免了此瓶颈。但是,通道分配依赖于集中控制,从而限制了性能,同时又限制了一小部分固定优先级,从而使输入端口容易饿死。另外,与以前的设计相比,更大的CMP将需要不断增加带宽。为了解决这些问题,我们提出了一个具有128b数据总线(总共8192条输入/输出线)的64x64单级拨动开关网络(SSN)。 SSN可以将任何输入连接到任何输出,包括多播。在25°C的45nm SOI CMOS中,其在1.1V时的峰值测量吞吐量为4.5Tb / s。 SSN的主要功能是:1)单周期最近授予(LRG)优先级仲裁技术,该技术可重用已经存在的输入和输出数据总线及其驱动器和检测放大器; 2)附加的基于4层消息的优先级仲裁,用于2%逻辑和3%布线开销的服务质量(QoS); 3)双向位线中继器,允许路由器扩展到>; 8000线。这些功能使结构紧凑(4.06mm2),在3.4Tb / s / W的效率下,吞吐量提高了2.1倍,在600mV时提高到7.4Tb / s / W。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号