A 4.5Tb/s 3.4Tb/s/W 64×64 switch fabric with self-updating least-recently-granted priority and quality-of-service arbitration in 45nm CMOS

机译：4.5Tb / s 3.4Tb / s / W 64×64交换矩阵，具有自更新的最近授予的最低优先级和45nm CMOS的服务质量仲裁

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

High-speed and low-power routers form the basic building blocks of on-die interconnect fabrics that are critical to overall throughput and energy efficiency of high performance systems. Conventional routers use distinct logic blocks for routing data and handling arbitration. At higher radices, connections between these blocks become a bottleneck, limiting router scalability and degrading performance. Recently, two switch topologies merged the data routing fabric with arbitration control, avoiding this bottleneck. However, relies on centralized control for channel allocation, limiting performance, while restricted to a small set of fixed priorities, rendering input ports prone to starvation. In addition, ever larger CMPs will require continued increases in bandwidth over previous designs. To address these issues, we present a 64x64 single-stage swizzle-switch network (SSN) with 128b data buses (8192 total input/output wires). The SSN can connect any input to any output, including multicast. It has a peak measured throughput of 4.5Tb/s at 1.1V in 45nm SOI CMOS at 25°C. The SSN''s key features are: 1) a single-cycle least-recently granted (LRG) priority arbitration technique that reuses the already present input and output data buses and their drivers and sense amps; 2) an additional 4-level message-based priority arbitration for quality of service (QoS) with 2% logic and 3% wiring overhead; 3) a bidirectional bitline repeater that allows the router to scale to >;8000 wires. These features result in a compact fabric (4.06mm2) with throughput gain of 2.1 x over at 3.4Tb/s/W efficiency, which improves to 7.4Tb/s/W at 600mV.

机译：高速和低功耗路由器构成了片上互连结构的基本构建模块，这些模块对高性能系统的整体吞吐量和能效至关重要。常规路由器使用不同的逻辑块来路由数据和处理仲裁。在更高的半径下，这些模块之间的连接成为瓶颈，从而限制了路由器的可扩展性并降低了性能。最近，两种交换拓扑将数据路由结构与仲裁控制合并在一起，避免了此瓶颈。但是，通道分配依赖于集中控制，从而限制了性能，同时又限制了一小部分固定优先级，从而使输入端口容易饿死。另外，与以前的设计相比，更大的CMP将需要不断增加带宽。为了解决这些问题，我们提出了一个具有128b数据总线（总共8192条输入/输出线）的64x64单级拨动开关网络（SSN）。 SSN可以将任何输入连接到任何输出，包括多播。在25°C的45nm SOI CMOS中，其在1.1V时的峰值测量吞吐量为4.5Tb / s。 SSN的主要功能是：1）单周期最近授予（LRG）优先级仲裁技术，该技术可重用已经存在的输入和输出数据总线及其驱动器和检测放大器； 2）附加的基于4层消息的优先级仲裁，用于2％逻辑和3％布线开销的服务质量（QoS）； 3）双向位线中继器，允许路由器扩展到>; 8000线。这些功能使结构紧凑（4.06mm2），在3.4Tb / s / W的效率下，吞吐量提高了2.1倍，在600mV时提高到7.4Tb / s / W。

著录项

来源
《Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2012 IEEE International》|2012年|p.478- 480|共3页
会议地点 San Francisco CA(US)
作者
Satpathy S.; Sewell K.; Manville T.; Yen-Po Chen; Dreslinski R.; Sylvester D.; Mudge T.; Blaauw D.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类半导体集成电路（固体电路）;
关键词

相似文献

外文文献
专利

1. 640-Gb/s high-speed ATM switching system based on 0.25-μm CMOS,MCM-C, and optical WDM interconnection [J] . Yamanaka N., Kawano R., Oki E., IEEE Transactions on Advanced Packaging . 2002,第1期

机译：基于0.25μmCMOS，MCM-C和光学WDM互连的640-Gb / s高速ATM交换系统
2. Comparative study of dual-pulsed 1064 nm Q-switched Nd:YAG laser and single-pulsed 1064 nm Q-switched Nd: YAG laser by using zebrafish model and prospective split-face analysis of facial melasma [J] . Jang Hee Won, Chun Seung Hyun, Park Hae Chul, Journal of cosmetic and laser therapy . 2017,第2期

机译：双脉冲1064nm Q开关Nd：YAG激光和单脉冲1064nm Q开关Nd：YAG激光使用斑马鱼模型的比较研究，面部截鱼的前瞻性分裂分析
3. A 640 imes 640 Fully Dynamic CMOS Image Sensor for Always-On Operation [J] . Park Injun, Jo Woojin, Park Chanmin, IEEE Journal of Solid-State Circuits . 2020,第4期

机译：640 Times 640全动态CMOS图像传感器，可始终开启操作
4. A 4.5Tb/s 3.4Tb/s/W 64#x00D7;64 switch fabric with self-updating least-recently-granted priority and quality-of-service arbitration in 45nm CMOS [C] . Satpathy S., Sewell K., Manville T., IEEE International Solid-State Circuits Conference . 2012

机译：4.5tb / s 3.4tb / s / w 64＆＃x00d7; 64个开关面料，具有自我更新最近授予的优先级和45nm CMOS的优先级仲裁
5. 57-64GHz CMOS transmitter with embedded artificial dieletric for frequency hopping and quasi -continuous phase control in antenna beam steering [D] . Gu, Qun 2007

机译：具有嵌入式人工电介质的57-64 GHz CMOS发射机，用于天线波束控制中的跳频和准连续相位控制
6. Insight into ECMO mortality and ARDS: a nationwide analysis of 45647 ECMO runs [O] . Benjamin Friedrichson, Haitham Mutlak, Kai Zacharowski, 2021

机译：洞察Ecmo死亡率和ARDS：45647欧洲杂草运行的全国范围内
7. CMOS Terahertz Metamaterial Based 64 × 64 Bolometric Detector Arrays [O] . Escorcia Carranza Ivonne, Grant James, Gough John, 2017

机译：基于CmOs太赫兹超材料的64×64测温探测器阵列

A 4.5Tb/s 3.4Tb/s/W 64×64 switch fabric with self-updating least-recently-granted priority and quality-of-service arbitration in 45nm CMOS

摘要

著录项

相似文献

相关主题

期刊订阅