首页> 外文会议>International Conference on Big Data Computing and Communications >SGraph: A Distributed Streaming System for Processing Big Graphs
【24h】

SGraph: A Distributed Streaming System for Processing Big Graphs

机译:SGraph:用于处理大图的分布式流系统

获取原文

摘要

Big graph processing has been widely used in various computational domains, ranging from language modeling to social networks. Graph-parallel systems have been proposed to process such big graphs on clusters with up to hundreds of nodes. However, the size of a big graph often exceeds the available main memories in a small cluster. As a consequence, task failures happen frequently. To address this problem, we propose SGraph, a distributed streaming graph processing system built on top of Spark. SGraph introduces a streaming data model to avoid loading all of the graph data which may exceed the available RAM space. In addition, SGraph leverages an edge-centric scatter-gather computing model that can be used to conveniently implement graph algorithms. Experiments demonstrate that SGraph can process graphs with up to 1.5 billion edges on small clusters with several low-cost commodity PCs, whereas existing systems may require up to tens or hundreds of high-end machines. Furthermore, SGraph is up to 2.3 times faster than existing systems.
机译:大图处理已广泛用于各种计算领域,从语言建模到社交网络。已经提出了图并行系统来处理具有多达数百个节点的群集上的大图。但是,大图的大小通常会超出小集群中可用的主内存。结果,任务失败频繁发生。为了解决这个问题,我们提出了SGraph,这是一个基于Spark的分布式流图处理系统。 SGraph引入了流数据模型,以避免加载可能超出可用RAM空间的所有图数据。此外,SGraph利用以边缘为中心的分散聚集计算模型,该模型可用于方便地实现图形算法。实验表明,SGraph可以在具有几台低成本商用PC的小型集群上处理多达15亿条边的图形,而现有系统可能需要多达数十或数百台高端计算机。此外,SGraph的速度比现有系统快2.3倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号