Prediction Router: A Low-Latency On-Chip Router Architecture with Multiple Predictors

Matsutani Hiroki; Koibuchi Michihiro; Amano Hideharu; Yoshinaga Tsutomu

首页> 外文期刊>Computers, IEEE Transactions on >Prediction Router: A Low-Latency On-Chip Router Architecture with Multiple Predictors

【24h】

Prediction Router: A Low-Latency On-Chip Router Architecture with Multiple Predictors

机译：预测路由器：具有多个预测器的低延迟片上路由器架构

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Multi and many-core applications are sensitive to interprocessor communication latencies, suggesting the need for low-latency on-chip networks. We propose a low-latency router architecture that predicts the output channel to be used by the next packet transfer and speculatively completes the switch arbitration to reduce communication latency. The packets coming into the prediction routers are transferred without waiting for the routing computation and switch arbitration if the prediction hits. Thus, the primary concern for reducing communication latency is the hit rates of the prediction algorithms, which vary based on network environments, such as the network topology, routing algorithm, and traffic pattern. Although typical low-latency routers that skip one or more pipeline stages use a bypass data path that is based on a static or single bypassing policy (e.g., accelerating the packets moving in the same dimension), our prediction router architecture predictively forwards packets based on the prediction algorithm selected from among several candidates in response to the network environment. We analyze the prediction hit rates of five prediction algorithms on meshes, tori, fat trees, and Spidergons. Then, we present four case studies, each of which assumes different many-core architectures. We implemented the prediction routers for each case study by using a 45 nm CMOS process, and evaluated them in terms of the prediction hit rate, zero-load latency, hardware amount, and energy consumption. A typical prediction router with two or three predictors shows that although the area and energy are increased by 4.8-12.0 percent and 5.3 percent, respectively, up to 89.8 percent of the prediction hit rate is achieved in real applications, which provides favorable trade-offs between modest hardware/energy overheads and significant latency saving.

机译：多核和多核应用程序对处理器间通信延迟很敏感，这表明需要低延迟的片上网络。我们提出了一种低延迟路由器体系结构，该体系结构可预测下一个数据包传输将使用的输出通道，并推测性地完成交换机仲裁以减少通信延迟。如果预测命中，则进入预测路由器的数据包无需等待路由计算和交换机仲裁就可以传输。因此，减少通信等待时间的主要考虑因素是预测算法的命中率，该命中率根据网络环境（例如网络拓扑，路由算法和流量模式）而变化。尽管跳过一个或多个流水线级的典型低延迟路由器使用基于静态或单个旁路策略的旁路数据路径（例如，加速以相同维度移动的数据包），但我们的预测路由器体系结构会根据响应于网络环境从多个候选对象中选择的预测算法。我们分析了网格，花托，胖树和蜘蛛上的五种预测算法的预测命中率。然后，我们提出了四个案例研究，每个案例研究都采用不同的多核架构。我们使用45 nm CMOS工艺为每个案例研究实现了预测路由器，并根据预测命中率，零负载延迟，硬件数量和能耗对它们进行了评估。具有两个或三个预测器的典型预测路由器显示，尽管面积和能量分别增加了4.8-12.0％和5.3％，但在实际应用中可以达到高达89.8％的预测命中率，这提供了有利的权衡适度的硬件/能源开销与显着的延迟节省之间。

著录项

来源
《Computers, IEEE Transactions on》 |2011年第6期|p.783-799|共17页
作者
Matsutani Hiroki; Koibuchi Michihiro; Amano Hideharu; Yoshinaga Tsutomu;
展开▼
作者单位

The University of Tokyo, Tokyo;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Interconnection networks; low-latency router architecture.; on-chip networks;

机译：互连网络;低延迟路由器体系结构;片上网络;

相似文献

外文文献
中文文献
专利

1. A Low-Latency, High-Throughput On-Chip Optical Router Architecture for Future Chip Multiprocessors [J] . MARK J. CIANCHETTI, DAVID H. ALBONESI ACM Journal on Emerging Technologies in Computing Systems . 2011,第2期

机译：适用于未来芯片多处理器的低延迟，高吞吐量的片上光路由器架构
2. A practical low-latency router architecture with wing channel for on-chip network [J] . Mingche Lai, Lei Gao, Sheng Ma, Microprocessors and microsystems . 2011,第2期

机译：实用的低延迟路由器架构，带有用于片上网络的wing通道
3. Low-Latency Virtual-Channel Routers for On-Chip Networks [J] . Robert Mullins, Andrew West, Simon Moore Computer architecture news . 2004,第2期

机译：片上网络的低延迟虚拟通道路由器
4. Prediction Router: Yet Another Low Latency On-Chip Router Architecture [C] . Hiroki Matsutani, Michihiro Koibuchi, Hideharu Amano, International Symposium on High Performance Computer Architecture . 2009

机译：预测路由器：又一个低延迟的片断路由器架构
5. Experimental Evaluation and Comparison of Time-Multiplexed Multi-FPGA Routing Architectures [D] . Kashif, Asmeen. 2017

机译：时间多路复用多FPGA路由架构的实验评估和比较
6. Dynamic Bus Travel Time Prediction Models on Road with Multiple Bus Routes [O] . Cong Bai, Zhong-Ren Peng, Qing-Chang Lu, 2015

机译：多条公交路线的公交车动态行驶时间预测模型
7. Low-Latency Virtual-Channel Routers for On-Chip Networks [O] . Robert Mullins, Andrew West, Simon Moore 2004

机译：用于片上网络的低延迟虚拟通道路由器
8. Integrated Deployment Architecture for Predictive Real-Time Traffic Routing Incorporation Human Factors Considerations. NEXTRANS Project No. 080PY04. [R] . Song, D., He, X., Peeta, S., 2014

机译：用于预测性实时流量路由的集成部署体系结构纳入人为因素考虑因素。 NEXTRaNs项目编号080pY04。

Prediction Router: A Low-Latency On-Chip Router Architecture with Multiple Predictors

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅