首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >FastBFS: Fast Breadth-First Graph Search on a Single Server
【24h】

FastBFS: Fast Breadth-First Graph Search on a Single Server

机译:FastBFS:在单个服务器上的快速宽度图形搜索

获取原文

摘要

Big graph computing can be performed over a single node, using recent systems such as GraphChi and XStream. Breadth-first graph search (a.k.a., BFS) has a pattern of marking each vertex only once as “visited” and then not using them in further computations. Existing single-server graph computing systems fail to take advantage of such access pattern of BFS for performance optimization, hence suffering from a lot of extra I/O latencies due to accessing no longer useful data elements of a big graph as well as wasting plenty of computing resources for processing them. In this paper, we propose FastBFS, a new approach that accelerates breadth-first graph search on a single server by leverage of the preceding access pattern during the BFS iterations over a big graph. First, FastBFS uses an edgecentric graph processing model to obtain the high bandwidth of sequential disk accesses without expensive data preprocessing. Second, with a new asynchronous trimming mechanism, FastBFS can efficiently reduce the size of a big graph by eliminating useless edges in parallel with the computation. Third, FastBFS schedules I/O streams efficiently and can attain greater parallelism if an additional disk is available. We implement FastBFS by modifying the X-Stream system developed by EPFL. Our experimental results show that FastBFS can outperform X-stream and GraphChi in the computing speed by up to 2.1 and 3.9 times faster respectively. With an additional disk, FastBFS can even outperform them by up to 3.6 and 6.5 times faster respectively.
机译:可以在单个节点上执行大图计算,使用最近的系统(如Graphichi和XStream)进行。宽度第一图形搜索(A.K.A.,BFS)具有标记每个顶点的模式,只需一次“访问”,然后在进一步计算中不使用它们。现有的单服务器图形计算系统无法利用BFS的此类访问模式以进行性能优化,因此由于访问的访问不再是大图的有用数据元素以及浪费大量而导致的大量额外的I / O潜伏计算资源以处理它们。在本文中,我们提出了一种新方法,通过在BFS迭代期间利用前面的访问模式,在一个大图中利用前面的访问模式加速了单个服务器的广度第一图形搜索。首先,FastBFS使用EDGECENTRIC图形处理模型来获得连续磁盘访问的高带宽,而无需昂贵的数据预处理。其次,通过新的异步修剪机制,FastBF可以通过消除与计算并行的无用边缘有效地减小大图的大小。第三,FastBFS有效地安排I / O流,如果可用额外的磁盘,则可以获得更大的并行性。我们通过修改EPFL开发的X流系统来实现FastBFS。我们的实验结果表明,FastBFS可以在计算速度下优于X型流和Graphichi,分别更快地更快2.1和3.9倍。使用额外的磁盘,FastBFS甚至可以快速地优于3.6和6.5倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号