首页> 外文期刊>PLoS One >FlexGraph: Flexible partitioning and storage for scalable graph mining
【24h】

FlexGraph: Flexible partitioning and storage for scalable graph mining

机译:FlexGraph:可伸缩图挖掘的灵活分区和存储

获取原文
           

摘要

How can we analyze large graphs such as the Web, and social networks with hundreds of billions of vertices and edges? Although many graph mining systems have been proposed to perform various graph mining algorithms on such large graphs, they have difficulties in processing Web-scale graphs due to massive communication and I/O costs caused by communication between workers, and reading subgraphs repeatedly. In this paper, we propose FlexGraph, a scalable distributed graph mining method reducing the costs by exploiting properties of real-world graphs. FlexGraph significantly decreases the communication cost, which is the main bottleneck of distributed systems, by exploiting different edge placement policies based on types of vertices. Furthermore, we propose a flexible storage format to reduce I/O costs when reading input graph repeatedly. Experiments show that FlexGraph succeeds in processing up to 64× larger graphs than existing distributed memory-based graph mining methods, and consistently outperforms previous disk-based graph mining methods.
机译:我们如何分析诸如Web等大型图形和数百千米次顶点和边的社交网络?尽管已经提出了许多图形挖掘系统在这种大图上执行各种图形挖掘算法,但由于工人之间的沟通和读取子图反复读取子图,因此在处理Web刻度图方面具有困难。在本文中,我们提出了FlexGraph,一种可扩展的分布式图形挖掘方法,通过利用现实世界图形的性质来降低成本。 FlexGraph通过基于顶点类型利用不同的边缘放置策略,显着降低了分布式系统的主要瓶颈的通信成本。此外,我们提出了一种柔性存储格式,以重复读取输入图时降低I / O成本。实验表明,FlexGraph成功地处理高达64倍的图形,而不是现有的基于内存的图形挖掘方法,并且始终如一地优于以前的基于磁盘的图形挖掘方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号