Accelerating Graph Analytics on CPU-FPGA Heterogeneous Platform

机译：CPU-FPGA异构平台加速图分析

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Hardware accelerators for graph analytics have gained increasing interest. Vertex-centric and edge-centric paradigms are widely used to design graph analytics accelerators. However, both of them have notable drawbacks: vertex-centric paradigm requires random memory accesses to traverse edges and edge-centric paradigm results in redundant edge traversals. In this paper, we explore the tradeoffs between vertex-centric and edge-centric paradigms and propose a hybrid algorithm which dynamically selects between them during the execution. We introduce the notion of active vertex ratio, based on which we develop a simple but efficient paradigm selection approach. We develop a hybrid data structure to concurrently support vertex-centric and edge-centric paradigms. Based on the hybrid data structure, we propose a graph partitioning scheme to increase parallelism and enable efficient parallel computation on heterogeneous platforms. In each iteration, we use our paradigm selection approach to select the appropriate paradigm for each partition. Further, we map our hybrid algorithm onto a stateof-the-art heterogeneous platform which integrates a multi-core CPU and a Field-Programmable Gate Array (FPGA) in a cache coherent fashion. We use our design methodology to accelerate two fundamental graph algorithms, breadth-first search (BFS) and single-source shortest path (SSSP). Experimental results show that our CPU-FPGA co-processing achieves up to 1.5× (1.9×) speedup for BFS (SSSP) compared with optimized baseline designs. Compared with the state-of-the-art FPGA-based designs, our design achieves up to 4.0× (4.2×) throughput improvement for BFS (SSSP). Compared with a state-of-the-art multi-core design, our design demonstrates up to 1.5× (1.8×) speedup for BFS (SSSP).

机译：图形分析的硬件加速器已获得越来越令人利益。以顶点为中心和边缘的范式广泛用于设计图形分析加速器。然而，两个都有显着的缺点：以顶点为中心的范例需要随机内存访问到遍历边缘和以边缘为中心的范例导致冗余边缘遍历。在本文中，我们探讨了顶视为中心和以边缘的范例之间的权衡，并提出了一种混合算法，其在执行期间动态地选择它们。我们介绍了主动顶点比的概念，基于我们开发了一种简单但有效的范式选择方法。我们开发混合数据结构以同时支持以顶点为中心和边缘的范例。基于混合数据结构，我们提出了一种图形分区方案来增加并行性，并在异构平台上实现有效的并行计算。在每次迭代中，我们使用我们的范式选择方法为每个分区选择适当的范例。此外，我们将混合算法映射到现有技术的异构平台上，该平台集成了多核CPU和现场可编程门阵列（FPGA）以高速缓存的相干方式。我们使用我们的设计方法来加速两个基本的图形算法，广度第一搜索（BFS）和单源最短路径（SSSP）。实验结果表明，与优化基线设计相比，我们的CPU-FPGA协同加工可实现BFS（SSP）的1.5倍（1.9×）加速。与最先进的FPGA设计相比，我们的设计可实现BFS（SSP）的4.0倍（4.2倍）的吞吐量改进。与最先进的多核设计相比，我们的设计展示了BFS（SSP）的1.5倍（1.8×）加速。

著录项

来源
《International Symposium on Computer Architecture and High Performance Computing》|2017年|186p|共8页
会议地点
作者
Shijie Zhou; Viktor K. Prasanna;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Field programmable gate arrays; Arrays; Acceleration; Central Processing Unit; Partitioning algorithms; Algorithm design and analysis;

机译：现场可编程门阵列;阵列;加速;中央处理单元;分区算法;算法设计和分析;

相似文献

外文文献
中文文献
专利

1. In-Depth Analysis on Microarchitectures of Modern Heterogeneous CPU-FPGA Platforms [J] . Choi Young-Kyu, Cong Jason, Fang Zhenman, ACM transactions on reconfigurable technology and systems . 2019,第1期

机译：现代异构CPU-FPGA平台的微体系结构的深入分析
2. Accelerating Gossip-Based Deep Learning in Heterogeneous Edge Computing Platforms [J] . Han Rui, Li Shilin, Wang Xiangwei, IEEE Transactions on Parallel and Distributed Systems . 2021,第7期

机译：在异构边缘计算平台中加速基于八卦的深度学习
3. Accelerated LiDAR data processing algorithm for self-driving cars on the heterogeneous computing platform [J] . Li Wei, Liang Jun, Zhang Yunquan, Computers & Digital Techniques, IET . 2020,第5期

机译：加速LIDAR数据处理算法在异构计算平台上的自动驾驶汽车
4. Accelerating Graph Analytics on CPU-FPGA Heterogeneous Platform [C] . Shijie Zhou, Viktor K. Prasanna International symposium on computer architecture and high performance computing . 2017

机译：在CPU-FPGA异构平台上加速图形分析
5. Efficient and Scalable Parallel Stochastic Gradient Descent on a Heterogeneous CPU-FPGA platform for Large Scale Machine Learning [D] . Rasoori, Sandeep. 2017

机译：用于大规模机器学习的异构CPU-FPGA平台上高效且可伸缩的平行随机梯度下降
6. Semalytics: a semantic analytics platform for the exploration of distributed and heterogeneous cancer data in translational research [O] . Andrea Mignone, Alberto Grand, Alessandro Fiori, 2019

机译：Semalytics：一个语义分析平台用于在翻译研究中探索分布式和异构癌症数据
7. Accelerated Simulation of Cell Biological Systems Using Heterogeneous Parallel Processing Platforms- A Survey [O] . K. Abhishek, N. Sreenivasa, S Balaji 2018

机译：使用异构平行处理平台的细胞生物系统的加速模拟 - 调查

Accelerating Graph Analytics on CPU-FPGA Heterogeneous Platform

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅