...
【24h】

Trace-Driven Emulation of Large-Scale Networks-on-Chip on FPGAs

机译:FPGA上的大规模片上网络的跟踪驱动仿真

获取原文
获取原文并翻译 | 示例

摘要

Research and development of large-scale Networks-on-Chip (NoCs) play a key role in designing future many-core systems but are challenging due to the lack of fast and accurate evaluation environments. In recent years, there have been several attempts to build NoC emulation systems using FPGAs. These studies have shown promising emulation speedups of up to several orders of magnitude compared with conventional software simulators. However, emulating large-scale NoCs with hundreds to thousands of nodes on FPGAs is a challenging problem because of the FPGA capacity constraints. Moreover, supporting trace-driven workloads captured from practical applications may drastically degrade the emulation speed, especially when the emulated NoC is large, because the trace data are typically very large and thus stored in off-chip memory (usually DRAM). In this paper, we first present estimates of the DRAM capacity and bandwidth required for emulating NoCs under trace-driven workloads. We next show that although the limitation of DRAM bandwidth can be alleviated by storing trace data to DRAM in order of the emulation cycle and sequentially loading them during the emulation, a cyclic dependency between trace data and the network may occur in some certain cases. Finally, we describe our approach to enable trace-driven emulation of large-scale NoCs on FPGAs. In our previous work, to overcome the FPGA capacity constraints, we have proposed an emulation method based on the time-division multiplexing technique where a NoC is emulated using a small number of physical nodes. In this work, we show that this emulation method also makes it much easier to achieve the DRAM bandwidth requirement when emulating large-scale NoCs under trace-driven workloads. We explain in detail how trace data are stored in DRAM and fed to the emulated NoC efficiently and show some preliminary evaluation results.
机译:大规模片上网络(NoC)的研究与开发在设计未来的多核系统中起着关键作用,但由于缺乏快速,准确的评估环境,因此具有挑战性。近年来,已经进行了几次尝试使用FPGA构建NoC仿真系统。这些研究表明,与传统的软件模拟器相比,仿真速度提高了几个数量级。但是,由于FPGA的容量限制,在FPGA上模拟具有数百至数千个节点的大规模NoC是一个具有挑战性的问题。此外,支持从实际应用中捕获的跟踪驱动的工作负载可能会极大地降低仿真速度,尤其是在仿真的NoC很大时,因为跟踪数据通常很大,因此存储在片外存储器(通常是DRAM)中。在本文中,我们首先介绍了在跟踪驱动的工作负载下模拟NoC所需的DRAM容量和带宽的估计。接下来我们表明,尽管可以通过按照仿真周期的顺序将跟踪数据存储到DRAM并在仿真过程中顺序加载它们来减轻DRAM带宽的限制,但是在某些情况下,跟踪数据和网络之间可能存在循环依赖性。最后,我们描述了在FPGA上实现跟踪驱动的大规模NoC仿真的方法。在我们之前的工作中,为了克服FPGA的容量限制,我们提出了一种基于时分复用技术的仿真方法,其中使用少量物理节点来仿真NoC。在这项工作中,我们表明,在跟踪驱动的工作负载下模拟大型NoC时,这种仿真方法还使满足DRAM带宽要求变得更加容易。我们将详细解释跟踪数据如何存储在DRAM中并有效地馈送到仿真NoC,并显示一些初步评估结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号