首页> 外文期刊>ACM transactions on reconfigurable technology and systems >Fast and Cycle-Accurate Emulation of Large-Scale Networks-on-Chip Using a Single FPGA

【24h】

Fast and Cycle-Accurate Emulation of Large-Scale Networks-on-Chip Using a Single FPGA

机译：使用单个FPGA的大规模片上网络的快速，精确周期仿真

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Modeling and simulation/emulation play a major role in research and development of novel Networks-on-Chip (NoCs). However, conventional software simulators are so slow that studying NoCs for emerging many-core systems with hundreds to thousands of cores is challenging. State-of-the-art FPGA-based NoC emulators have shown great potential in speeding up the NoC simulation, but they cannot emulate large-scale NoCs due to the FPGA capacity constraints. Moreover, emulating large-scale NoCs under synthetic workloads on FPGAs typically requires a large amount of memory and thus involves the use of off-chip memory, which makes the overall design much more complicated and may substantially degrade the emulation speed. This article presents methods for fast and cycle-accurate emulation of NoCs with up to thousands of nodes using a single FPGA. We first describe how to emulate a NoC under a synthetic workload using only FPGA on-chip memory (BRAMs). We next present a novel use of time-division multiplexing where BRAMs are effectively used for emulating a network using a small number of nodes, thereby overcoming the FPGA capacity constraints. We propose methods for emulating both direct and indirect networks, focusing on the commonly used meshes and fat-trees (k-ary n-trees). This is different from prior work that considers only direct networks. Using the proposed methods, we build a NoC emulator, called FNoC, and demonstrate the emulation of some mesh-based and fat-tree-based NoCs with canonical router architectures. Our evaluation results show that (1) the size of the largest NoC that can be emulated depends on only the FPGA on-chip memory capacity; (2) a mesh-based NoC with 16,384 nodes (128 x 128 NoC) and a fat-tree-based NoC with 6,144 switch nodes and 4,096 terminal nodes (4-ary 6-tree NoC) can be emulated using a single Virtex-7 FPGA; and (3) when emulating these two NoCs, we achieve, respectively, 5,047x and 232x speedups over BookSim, one of the most widely used software-based NoC simulators, while maintaining the same level of accuracy.

机译：建模和仿真/仿真在新型片上网络（NoC）的研发中起着重要作用。但是，传统的软件模拟器太慢了，以至于研究新兴的具有数百至数千个内核的多核系统的NoC颇具挑战。基于FPGA的最先进的NoC仿真器在加速NoC仿真方面显示出了巨大的潜力，但是由于FPGA的容量限制，它们无法仿真大规模的NoC。此外，在FPGA上的合成工作负载下仿真大规模NoC通常需要大量的存储器，因此涉及使用片外存储器，这使整体设计更加复杂，并可能大大降低仿真速度。本文介绍了使用单个FPGA对多达数千个节点的NoC进行快速，精确周期仿真的方法。我们首先描述如何仅使用FPGA片上存储器（BRAM）在综合工作负载下仿真NoC。接下来，我们提出一种时分复用的新颖用法，其中BRAM有效地用于模拟使用少量节点的网络，从而克服了FPGA的容量限制。我们提出了直接和间接网络的仿真方法，重点是常用的网格和胖树（k元n树）。这不同于仅考虑直接网络的先前工作。使用提出的方法，我们构建了一个称为FNoC的NoC仿真器，并演示了使用规范路由器体系结构对某些基于网格和基于胖树的NoC的仿真。我们的评估结果表明：（1）可以仿真的最大NoC的大小仅取决于FPGA片上存储器的容量；（2）可以使用单个Virtex-V仿真具有16384个节点（128 x 128 NoC）的基于网格的NoC和具有6,144个交换节点和4,096个终端节点（4进制6树的NoC）的基于胖树的NoC。 7 FPGA; （3）在模拟这两个NoC时，我们分别比BookSim（基于软件的NoC模拟器使用最广泛的软件之一）获得了5,047x和232x的加速，同时保持了相同的准确性。

著录项

来源
《ACM transactions on reconfigurable technology and systems》 |2017年第4期|27.1-27.27|共27页
作者
Van Chu Thiem; Sato Shimpei; Kise Kenji;
展开▼
作者单位

Tokyo Inst Technol, Dept Comp Sci, Meguro Ku, 2-12-1-W8-79 Ookayama, Tokyo 1528550, Japan;

Tokyo Inst Technol, Dept Informat & Commun Engn, Meguro Ku, 2-12-1-S3-58 Ookayama, Tokyo 1528550, Japan;

Tokyo Inst Technol, Dept Comp Sci, Meguro Ku, 2-12-1-W8-79 Ookayama, Tokyo 1528550, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Emulation; FPGA; many-core; network-on-chip;

机译：仿真;FPGA;多核;片上网络;

相似文献

外文文献
中文文献
专利

1. Trace-Driven Emulation of Large-Scale Networks-on-Chip on FPGAs [J] . Thiem Van CHU, Kenji KISE 電子情報通信学会技術研究報告. リコンフィギャラブルシステム. Reconfigurable Systems . 2016,第417期

机译：FPGA上的大规模片上网络的跟踪驱动仿真
2. Trace-Driven Emulation of Large-Scale Networks-on-Chip on FPGAs [J] . Thiem Van CHU, Kenji KISE 電子情報通信学会技術研究報告. VLSI設計技術. VLSI Design Technologies . 2016,第415期

机译：FPGA上的大规模片上网络的跟踪驱动仿真
3. Trace-Driven Emulation of Large-Scale Networks-on-Chip on FPGAs [J] . Thiem Van CHU, Kenji KISE 電子情報通信学会技術研究報告. コンピュ-タシステム. Computer Systems . 2016,第416期

机译：FPGA上的大规模片上网络的跟踪驱动仿真
4. Enabling Fast and Accurate Emulation of Large-Scale Network on Chip Architectures on a Single FPGA [C] . Thiem Van Chu, Sato Shimpei, Kise Kenji International Symposium on Field-Programmable Custom Computing Machines . 2015

机译：在单个FPGA上实现大规模片上网络架构的快速，准确仿真
5. Large-Scale Real-Time Electromagnetic Transient Simulation of Power Systems Using Hardware Emulation on FPGAs. [D] . Chen, Yuan. 2012

机译：使用FPGA上的硬件仿真的电力系统大规模实时电磁暂态仿真。
6. NEBULA is a fast negative binomial mixed model for differential or co-expression analysis of large-scale multi-subject single-cell data [O] . Liang He, Jose Davila-Velderrain, Tomokazu S. Sumida, 2021

机译：星云是大规模多对象单细胞数据的差分或共表达分析的快速负二项式混合模型
7. A Fault Injection Methodology and Infrastructure for Fast Single Event Upsets Emulation on Xilinx SRAM-based FPGAs [O] . Di Carlo Stefano, Prinetto Paolo Ernesto, Rolfo D., 2014

机译：基于Xilinx SRAM的FPGA的快速单事件翻转仿真的故障注入方法和基础设施

获取原文

客服邮箱：kefu@zhangqiaokeyan.com

京公网安备：11010802029741号 ICP备案号：京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有

客服微信
服务号