首页> 外文期刊>Computing >FPGA friendly NoC simulation acceleration framework employing the hard blocks
【24h】

FPGA friendly NoC simulation acceleration framework employing the hard blocks

机译:FPGA友好的NOC仿真加速框架,采用硬块

获取原文
获取原文并翻译 | 示例

摘要

A major role is played by Modeling and Simulation platforms in development of a new Network-on-Chip (NoC) architecture. The cycle accurate software simulators tend to become slow when simulating thousands of cores on a single chip. FPGAs have become the vehicle for simulation acceleration due to the properties of parallelism. Most of the state-of-the-art FPGA based NoC simulators utilize soft logic only for modeling the NoCs, leaving out the hard blocks to be unutilized. In this work, the FIFO Buffer and Crossbar switch functionalities of the NoC router have been embedded in the Block RAM (BRAMs) and the DSP48E1 slices with large multiplexer respectively. Employing the proposed techniques of mapping the NoC router components on the FPGA hard blocks, an NoC simulation acceleration framework based on the FPGA is presented in this work. A huge reduction in the use of the Configurable Logic Blocks (CLBs) has been observed when the FIFO buffer and Crossbar components of the NoC topology's router micro-architecture are embedded in FPGA hard blocks. Our experimental results show that the topologies implemented employing the proposed FPGA friendly mapping of the NoC router components on the hard blocks consume 43.47% fewer LUTs and 41.66% fewer FFs than the topologies with CLB implementation. To optimize the latency of the NoC under consideration, a control unit called "buf_empty_checker" has been employed. A reduction in average latency has been observed compared to the CLB based topology implementation employing the proposed mapping. The proposed work consumes 10.88% fewer LUTs than the CONNECT NoC generation tool. Compared to DART, a reduction of 73.38% and 66.55% in LUTs and FFs has been observed with respect to the proposed work. The average packet latency of the proposed NoC architecture is 24.8% and 19.1% lesser than the CONNECT and DART architectures.
机译:通过开发新的片上(NOC)架构的建模和仿真平台来扮演主要作用。当在单个芯片上模拟数千个核心时,循环精确的软件模拟器往往会变慢。由于平行性的性质,FPGA已成为模拟加速的车辆。基于最先进的FPGA的NOC模拟器利用SOFT逻辑来建模NOC,留出了要联合的硬块。在这项工作中,NOC路由器的FIFO缓冲区和横杆交换机功能分别嵌入在块RAM(BRAM)和具有大多路复用器的DSP48E1切片中。采用在FPGA硬块上映射NOC路由器组件的建议技术,在这项工作中提出了基于FPGA的NOC仿真加速框架。当NOC拓扑路由器微架构的FIFO缓冲区和横杆组件嵌入在FPGA硬块中时,已经观察到使用可配置逻辑块(CLB)的巨大减少。我们的实验结果表明,在硬块上采用所提出的FPGA友好映射所实施的拓扑结构比使用CLB实施的拓扑造成43.47%的LUT和41.66%,而不是41.66%。为了优化所考虑的NOC的延迟,已经采用了一个名为“buf_empty_checker”的控制单元。与采用所提出的映射的基于CLB的拓扑实现相比,已经观察到平均延迟的减小。拟议的工作消耗了比Connect NoC代成工具更少的LUT减少了10.88%。与Dart相比,已经遵守拟议工作的LUT和FFS减少73.38%和66.55%。拟议的NOC架构的平均数据包延迟比Connect和Dart架构更小24.8%和19.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号