【24h】

Hoplite: Building austere overlay NoCs for FPGAs

机译:Hoplite:为FPGA构建严格的覆盖NoC

获取原文

摘要

Customized unidirectional, bufferless, deflection-routed torus networks can outperform classic, bidirectional, buffered mesh networks for single-flit-oriented FPGA applications by as much as 1.5× (best achievable throughputs for a 10×10 system) or 2.5× (allocating same FPGA resources to both NoCs) for uniform random traffic. We present Hoplite, an efficient, lightweight, fast FPGA overlay NoC that is designed to be small and compact by (1) eliminating input buffers, and (2) reducing the cost of switch crossbar that have traditionally limited speeds and imposed heavy resource costs in conventional FPGA overlay NoCs. We implement bufferless deflection routing cheaply, requiring the generation of only output multiplexer controls and no backpressure handshakes. Additionally, we use directional channels that help reduce crossbar cost by restricting the number of inputs to the crossbar to three instead of four. When compared to buffered mesh switches, FPGA-based deflection routers are ≈3.5× smaller (HLS-generated switch) and 2.5× faster (clock period) for 32b payloads. In a separate experiment, we hand-crafted a prototype RTL version of our switch with RLOCS that requires only 60 LUTs and 100 FFs per router and runs at 2.9 ns.
机译:定制的单向,无缓冲,偏转路径环形网络可以比传统的双向,缓冲网状网络(面向单平面FPGA应用)高1.5倍(10×10系统的最佳吞吐量)或2.5倍(分配相同)。两个NoC的FPGA资源),以实现统一的随机流量。我们介绍了Hoplite,这是一种高效,轻便,快速的FPGA覆盖NoC,通过(1)消除输入缓冲器,以及(2)降低了传统上速度受限并增加资源成本的开关交叉开关的成本,将其设计为小型紧凑。常规的FPGA覆盖NoC。我们便宜地实现了无缓冲偏转路由,只需要生成输出多路复用器控件,而无需背压握手。此外,我们使用定向通道,通过将交叉开关的输入数量限制为三个而不是四个,从而帮助降低了交叉开关的成本。与带缓冲的网状交换机相比,对于32b有效载荷,基于FPGA的偏转路由器小约3.5倍(HLS生成的交换机),而快2.5倍(时钟周期)。在一个单独的实验中,我们使用RLOCS手工制作了RTLCS交换机的原型RTL版本,每个路由器仅需要60个LUT和100 FF,并且运行时间为2.9 ns。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号