首页> 外文会议>IEEE International Congress on Big Data >Enumerating Maximal Bicliques from a Large Graph Using MapReduce
【24h】

Enumerating Maximal Bicliques from a Large Graph Using MapReduce

机译:使用MapReduce从大型图中枚举最大Bicliques

获取原文

摘要

We consider the enumeration of maximal bipartite cliques (bicliques) from a large graph, a task central to many practical data mining problems in social network analysis and bioinformatics. We present novel parallel algorithms for the MapReduce platform, and an experimental evaluation using Hadoop MapReduce. Our algorithm is based on clustering the input graph into smaller sized subgraphs, followed by processing different subgraphs in parallel. Our algorithm uses two ideas that enable it to scale to large graphs: (1) the redundancy in work between different subgraph explorations is minimized through a careful pruning of the search space, and (2) the load on different reducers is balanced through the use of an appropriate total order among the vertices. Our evaluation shows that the algorithm scales to large graphs with millions of edges and tens of millions of maximal bicliques. To our knowledge, this is the first work on maximal biclique enumeration for graphs of this scale.
机译:我们考虑从大图中的最大二分派系(Bicliques)的枚举,这是社会网络分析和生物信息学中许多实际数据挖掘问题的任务中心。我们为MapReduce平台提出了新的并行算法,以及使用Hadoop MapReduce进行实验评估。我们的算法基于将输入图聚集成更小的尺寸子图,然后并行处理不同的子图。我们的算法使用了两个想法,使其能够扩展到大图:(1)通过仔细修剪搜索空间的仔细修剪,不同子图探索之间的工作中的冗余,(2)不同减速器的负载通过使用平衡在顶点之间的适当总令。我们的评估表明,该算法缩小到具有数百万边缘的大图和数千万的最大双层。据我们所知,这是该规模图形最大Biclique枚举的第一个工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号