...
首页> 外文期刊>Computer science >Understanding parallelism in graph traversal on multi-core clusters
【24h】

Understanding parallelism in graph traversal on multi-core clusters

机译:了解多核集群上图遍历的并行性

获取原文
获取原文并翻译 | 示例

摘要

There is an ever-increasing need for exploring large-scale graph data sets in computational sciences, social networks, and business analytics. However, due to irregular and memory-intensive nature, graph applications are notoriously known for their poor performance on parallel computer systems. In this paper we propose a new hybrid MPI/Pthreads breadth-first search (BFS) algorithm featuring with (ⅰ) overlapping computation and communication by separating them into multiple threads, (ⅱ) maximizing multi-threading parallelism on multi-cores with massive threads to improve throughputs, and (ⅲ) exploiting pipeline parallelism using lock-free queues for asynchronous communication. By comparing it with traditional MPI-only BFS algorithm, we learned several valuable lessons that would help to understand and exploit parallelism in graph traversal applications. Experiments show our algorithm is 1.9 x faster than the MPI-only version, capable of processing 1.45 billion edges per second on a 32-node SMP cluster. At a large scale, our algorithm is 1.49x than the MPI-only BFS algorithm in Combinatorial BLAS Library with 6,144 cores.
机译:越来越需要在计算科学,社交网络和业务分析中探索大型图形数据集。但是,由于不规则和占用大量内存的性质,众所周知,图形应用程序在并行计算机系统上的性能较差。在本文中,我们提出了一种新的MPI / Pthreads广度优先混合搜索算法,该算法具有(featuring)将计算和通信分离为多个线程的重叠和(communication)最大化具有大线程的多核上的多线程并行性以提高吞吐量,以及(ⅲ)使用无锁队列进行异步通信来利用管道并行性。通过将其与仅使用MPI的传统BFS算法进行比较,我们学到了一些宝贵的经验教训,有助于理解和利用图遍历应用程序中的并行性。实验表明,我们的算法比仅MPI版本快1.9倍,能够在32节点SMP集群上每秒处理14.5亿条边。在大规模情况下,我们的算法比具有6144个核的组合BLAS库中仅MPI的BFS算法高1.49倍。

著录项

  • 来源
    《Computer science》 |2013年第3期|193-201|共9页
  • 作者单位

    State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China,Graduate School of Chinese Academy of Sciences, Beijing, China;

    State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;

    State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;

    State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Breadth-first search; Graph algorithms; Hybrid MPI/Pthreads programming; Lock-free queues;

    机译:广度优先搜索;图算法;混合MPI / Pthreads编程;无锁队列;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号