基于CombBLAS的同辈压力图聚类并行算法的设计与实现

邹佩钢; 陈军

首页> 中文期刊> 《计算机工程与科学》 >基于CombBLAS的同辈压力图聚类并行算法的设计与实现

基于CombBLAS的同辈压力图聚类并行算法的设计与实现

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

图聚类是指把图中相对连接紧密的顶点及其相关的边分组形成一个子图的过程,在包括机器学习、数据挖掘、模式识别、图像分析及生物信息等领域有着广泛应用.但是,随着大数据时代的到来,图数据海量增长.面对广泛的大规模图计算需求,由于图结构本身的不规则性,单机算法运行效率低下,用传统的并行计算方法进行图计算难以获得高性能.使用线性代数的方法在Combinatorial BLAS上实现了同辈压力(Peer Pressure)图聚类的分布式算法,首先将该图聚类的算法转换为对稀疏矩阵的运算,从而结构化表示图的不规则数据结构及接入模式,然后基于MPI编程模型将其并行实现.实验结果表明,在并行处理规模达到43亿的由稀疏矩阵表示的超大规模图时,基于线性代数表示的同辈压力图聚类算法在曙光超级计算机上取得了较高的并行性能及良好的可扩展性,在64个核上获得了40.1的并行加速.%Graph clustering is a problem of determining natural groups with high connectivity in a graph.This can be useful in fields such as machine learning,data mining,pattern recognition,image analysis and bioinformatics.To meet the graph-theoretic analysis demands of emerging"big data" applications,it is essential to speed up the underlying graph problems of current parallel systems.However,it is difficult to parallelize large-scale graph computation and achieve good performance using traditional approaches due to their irregular graph structure and low operation intensity.We implement a scalable distributed-memory algorithm for peer pressure graph clustering using the sparse matrix infrastructure in Combinatorial BLAS.We first convert the peer pressure graph clustering algorithm to sparse matrix computation,which allows irregular data structures and access patterns in parallel applications to be represented and can efficiently address the graph parallel challenge.Finally,the proposed algorithm is parallelized based on the MPI programming model.Experiments show that when the scale of the graph represented by a sparse matrix is up to 4.3 billion,the parallel peer pressure clustering algorithm based on linear algebraic has high performance and is well scalable on the Dawning Supercomputer,and the speedup can be up to 40.1x when the number of core scales to 64.

著录项

来源
《计算机工程与科学》 |2017年第3期|424-429|共6页
作者
邹佩钢; 陈军;
展开▼
作者单位

北京应用物理与计算数学研究所;

北京100088;

中国工程物理研究院研究生院;

北京100088;

北京应用物理与计算数学研究所;

北京100088;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
图计算; 同辈压力聚类; 并行; Combinatorial BLAS; 稀疏矩阵; 大规模图; MPI;

相似文献

中文文献
外文文献
专利

1. 基于并行算法的快速人脸识别系统设计与实现 [J] . 许嘉诚 . 无线互联科技 . 2020,第006期
2. 基于并行算法的图像秘密共享方案的设计与实现 [J] . 侯颖 . 信息技术与信息化 . 2020,第006期
3. 一种基于GPU集群的深度优先并行算法设计与实现 [J] . 余莹 ,李肯立 ,郑光勇 . 计算机科学 . 2015,第001期
4. 基于CUDA的图像分割并行算法设计与实现 [J] . 侯广峰 ,王媛媛 ,郭禾 . 数字技术与应用 . 2013,第003期
5. 基于CUDA的图像分割并行算法设计与实现 [J] . 侯广峰 ,王媛媛 ,郭禾 . 数字技术与应用 . 2013,第003期
6. 基于线性代数的同辈压力图聚类并行算法优化 [C] . Zou Peigang ,邹佩钢 ,Chen Jun . 2016年全国高性能计算学术年会 . 2016
7. 大学生同辈压力量表的修订及其同辈压力现况调查研究 [A] . 王静贤 . 2018

基于CombBLAS的同辈压力图聚类并行算法的设计与实现

摘要

著录项

相似文献

相关主题

期刊订阅