Efficient Parallel Subgraph Counting Using G-Tries

机译：使用G-Tries的高效并行子图计数

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Finding and counting the occurrences of a collection of subgraphs within another larger network is a computationally hard problem, closely related to graph isomorphism. The subgraph count is by itself a very powerful characterization of a network and it is crucial for other important network measurements. G-tries are a specialized data-structure designed to store and search for subgraphs. By taking advantage of subgraph common substructure, g-tries can provide considerable speedups over previously used methods. In this paper we present a parallel algorithm based precisely on g-tries that is able to efficiently find and count subgraphs. The algorithm relies on randomized receiver-initiated dynamic load balancing and is able to stop its computation at any given time, efficiently store its search position, divide what is left to compute in two halfs, and resume from where it left. We apply our algorithm to several representative real complex networks from various domains and examine its scalability. We obtain an almost linear speedup up to 128 processors, thus allowing us to reach previously unfeasible limits. We showcase the multidisciplinary potential of the algorithm by also applying it to network motif discovery.

机译：在另一个较大的网络中查找和计数子图集合的出现是一个计算难题，与图同构密切相关。子图计数本身是网络的非常强大的表征，对于其他重要的网络测量至关重要。 G-tries是专门用于存储和搜索子图的专用数据结构。通过利用子图的通用子结构，与以前使用的方法相比，g-tries可以提供可观的加速。在本文中，我们提出了一种基于g-tries的并行算法，该算法能够有效地查找和计数子图。该算法依靠随机的接收器启动的动态负载平衡，并且能够在任何给定时间停止其计算，有效地存储其搜索位置，将剩下的要计算的内容分成两半，然后从剩下的位置恢复。我们将算法应用于来自各个领域的几个代表性的实际复杂网络，并研究其可扩展性。我们获得了多达128个处理器的几乎线性加速，从而使我们能够达到以前无法实现的极限。通过将其应用于网络主题发现，我们展示了该算法的多学科潜力。

著录项

来源
《2010 IEEE International Conference on Cluster Computing》|2010年|p.217-226|共10页
会议地点
作者
Ribeiro Pedro; Silva Fernando; Lopes Luis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类分子生物学;
关键词
Adaptive Load Balancing; Complex Networks; G-Tries; Graph Mining; Parallel Algorithms;

机译：自适应负载平衡;复杂网络; G-Tries;图挖掘;并行算法;

相似文献

外文文献
中文文献
专利

1. G-Tries: a data structure for storing and finding subgraphs [J] . Pedro Ribeiro, Fernando Silva Data mining and knowledge discovery . 2014,第2期

机译：G-Tries：用于存储和查找子图的数据结构
2. An efficiently computable subgraph pattern support measure: counting independent observations [J] . Yuyi Wang, Jan Ramon, Thomas Fannes Data Mining and Knowledge Discovery . 2013,第3期

机译：一种可有效计算的子图模式支持措施：计算独立观察值
3. An efficiently computable subgraph pattern support measure: Counting independent observations [J] . Wang Y., Ramon J., Fannes T. Data mining and knowledge discovery . 2013,第3期

机译：有效计算子图模式的支持措施：计算独立观察值
4. Efficient Parallel Subgraph Counting Using G-Tries [C] . Ribeiro Pedro, Silva Fernando, Lopes Luis 2010 IEEE International Conference on Cluster Computing . 2010

机译：使用G-Tries的高效并行子图计数
5. Upper Tails of Subgraph Counts in Sparse Regular Graphs [D] . Gunby, Benjamin. 2021

机译：子图的上部尾部计数稀疏常规图表
6. A fast lock-free approach for efficient parallel counting of occurrences of k-mers [O] . Guillaume Marçais, Carl Kingsford -1

机译：快速无锁的方法可有效地并行计算k-mers的出现
7. Efficiently Counting Vertex Orbits of All 5-vertex Subgraphs, by EVOKE [O] . Noujan Pashanasangi, C. Seshadhri 2020

机译：通过唤起有效地计算所有5个顶点子图的顶点轨道

Efficient Parallel Subgraph Counting Using G-Tries

摘要

著录项

相似文献

相关主题

期刊订阅