【24h】

High-Throughput and Energy-Efficient Graph Processing on FPGA

机译:FPGA上的高通量和节能图形处理

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a novel design for large-scale graph processing on FPGA. Our design uses large external memory for storing massive graph data and FPGA for acceleration, and leverages edge-centric computing principles. We propose a data layout which optimizes the external memory performance and leads to an efficient memory activation schedule to reduce on-chip memory power consumption. Further, we develop a parallel architecture on FPGA which can saturate the external memory bandwidth and concurrently process multiple input data to increase throughput. We use our design to accelerate several classic graph algorithms, including single-source shortest path, weakly connected component, and minimum spanning tree. Experimental results show that for all the considered graph algorithms, our design achieves high throughput of over 600 million traversed edges per second (MTEPS) and high energy-efficiency of over 30 MTEPS/W. Compared with a baseline design, our optimizations result in over 3.6× throughput and 5.8× energy-efficiency improvements, respectively. Our design achieves 32% throughput improvement when compared with state-of-the-art FPGA designs, and up to 7.8× speedup when compared with state-of-the-art multi-core implementation.
机译:在本文中,我们提出了一种在FPGA上进行大规模图形处理的新颖设计。我们的设计使用大型外部存储器存储大量图形数据,并使用FPGA进行加速,并利用以边缘为中心的计算原理。我们提出了一种数据布局,该布局可优化外部存储器性能并导致有效的存储器激活计划,从而降低片上存储器的功耗。此外,我们在FPGA上开发了并行架构,该架构可以饱和外部存储器带宽并同时​​处理多个输入数据以提高吞吐量。我们使用我们的设计来加速几种经典的图形算法,包括单源最短路径,弱连接组件和最小生成树。实验结果表明,对于所有考虑的图形算法,我们的设计均实现了每秒超过6亿个横向边(MTEPS)的高吞吐量,以及超过30 MTEPS / W的高能效。与基准设计相比,我们的优化分别使吞吐量提高了3.6倍,能源效率提高了5.8倍。与最新的FPGA设计相比,我们的设计将吞吐量提高了32%,与最新的多核实现相比,其速度提高了7.8倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号