首页> 外文会议>IEEE International Congress on Big Data >Connecting the dots: Triangle completion and related problems on large data sets using GPUs
【24h】

Connecting the dots: Triangle completion and related problems on large data sets using GPUs

机译:连接点:使用GPU在大型数据集上的三角形补全和相关问题

获取原文

摘要

Studying the properties of Online Social Networks (OSNs) and other real world graphs have gained importance due to the large amount of information available from them. These large graphs contain data that can be analyzed and effectively used in advertising, security and improving the overall experience of the users of these networks. However, the analysis of these graphs for studying specific properties requires combinatorially explosive number of computations. Compute Unified Device Architecture (CUDA) is a programming model available from Nvidia for solving general-purpose problems using the massively parallel and highly multi-threaded Graphics Processing Units (GPUs). Therefore, using GPUs to solve these types of problems is appropriate. In addition, due to the properties of real-world data, the graphs being considered are sparse and have irregular data dependencies. Hence, using efficient techniques to store the graph data for initial preprocessing and final computation by taking advantage of heterogeneous CPU-GPU systems can address these issues. In this paper, we are interested in studying different properties of these real-world entities that transform into the following graph problems: a) identifying a missing edge, which when added would result in maximum increase in the number of triangles, b) identifying an existing edge whose removal would result in the maximum decrease in the number of triangles, c) identifying an existing edge whose removal would increase the number of connected components in the graph. In this paper, we develop and implement algorithms to solve the above problems using both CPU and GPU. Specifically, given a graph G = (V, E), we provide algorithms for the following: a) find (v, V) ∉ E, such that Δ - Δ is maximized, where Δ and Δ are the number of triangles in G = (V, E ∪(v, V)) an- G, respectively, b) find a (v, V) ϵ E, such that Δ - Δ is maximized, where Δ and Δ are the number of triangles in G = (V, E (v, V)) and G = (V, E), respectively, c) find a (v, V) ϵ E, such that Φ > Φ, where Φ and Φ are the number of connected components in G = (V, E (v, V)) and G = (V, E), respectively. We implement the algorithms using a GPU and achieve a 10 × speedup as compared to a sequential implementation. Thereafter, we design a heuristic for finding an edge whose existence would result in the maximum increase in the number of triangles. The heuristic is implemented and the results are reported and compared to those of the regular algorithm on the GPU.
机译:由于对在线社交网络(OSN)和其他现实世界图的可用信息量很大,因此研究它们的重要性变得越来越重要。这些大图包含可以分析的数据,可以有效地用于广告,安全性以及改善这些网络用户的整体体验。但是,为研究特定特性而对这些图进行分析需要组合爆炸性的计算量。计算统一设备体系结构(CUDA)是Nvidia提供的一种编程模型,用于使用大规模并行和高度多线程的图形处理单元(GPU)解决通用问题。因此,使用GPU解决这些类型的问题是合适的。此外,由于实际数据的属性,所考虑的图形稀疏且具有不规则的数据依存关系。因此,通过利用异构CPU-GPU系统的优势,使用有效的技术来存储图形数据以进行初始预处理和最终计算可以解决这些问题。在本文中,我们有兴趣研究这些现实世界实体的不同属性,这些属性转化为以下图形问题:a)识别缺失的边,添加后将导致三角形数量的最大增加,b)识别现有边缘,将其删除将导致三角形数量最大减少; c)标识现有边缘,将其删除将增加图形中连接组件的数量。在本文中,我们开发并实现了使用CPU和GPU来解决上述问题的算法。具体来说,给定图G =(V,E),我们提供以下算法:a)找到(v,V)∉E,使Δ-Δ最大化,其中Δ和Δ是G中的三角形数量=(V,E∪(v,V))an- G,b)找出一个(v,V)ϵ E,使得Δ-Δ最大化,其中Δ和Δ是G = (V,E(v,V))和G =(V,E),c)找出一个(v,V)ϵ E,使得Φ>Φ,其中Φ和Φ是G =(V,E(v,V))和G =(V,E)。与顺序实现相比,我们使用GPU来实现算法,并实现了10倍的加速。此后,我们设计了一种启发式算法,以找到一条边缘,该边缘的存在将导致三角形数量的最大增加。实现了启发式方法,并报告了结果并将其与GPU上常规算法的结果进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号