Designing an efficient parallel spectral clustering algorithm on multi-core processors in Julia

Zenan Huo; Gang Mei; Giampaolo Casolla; Fabio Giampaolo

首页> 外文期刊>Journal of Parallel and Distributed Computing >Designing an efficient parallel spectral clustering algorithm on multi-core processors in Julia

【24h】

Designing an efficient parallel spectral clustering algorithm on multi-core processors in Julia

机译：在朱莉娅的多核处理器上设计高效并行谱聚类算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Spectral clustering is widely used in data mining, machine learning and other fields. It can identify the arbitrary shape of a sample space and converge to the global optimal solution. Compared with the traditional k-means algorithm, the spectral clustering algorithm has stronger adaptability to data and better clustering results. However, the computation of the algorithm is quite expensive. In this paper, an efficient parallel spectral clustering algorithm on multi-core processors in the Julia language is proposed, and we refer to it as juPSC. The Julia language is a high-performance, open-source programming language. The juPSC is composed of three procedures: (1) calculating the affinity matrix, (2) calculating the eigenvectors, and (3) conducting fc-means clustering. Procedures (1) and (3) are computed by the efficient parallel algorithm, and the COO format is used to compress the affinity matrix. Two groups of experiments are conducted to verify the accuracy and efficiency of the juPSC. Experimental results indicate that (1) the juPSC achieves speedups of approximately 14×～ 18× on a 24-core CPU and that (2) the serial version of the juPSC is faster than the Python version of scikit-learn. Moreover, the structure and functions of the juPSC are designed considering modularity, which is convenient for combination and further optimization with other parallel computing platforms.

机译：光谱聚类广泛用于数据挖掘，机器学习和其他领域。它可以识别采样空间的任意形状并收敛到全局最佳解决方案。与传统的K-Means算法相比，光谱聚类算法对数据和更好的聚类结果具有更强的适应性。但是，算法的计算非常昂贵。在本文中，提出了朱莉娅语言中的多核处理器的有效并行谱聚类算法，并将其称为Jupsc。 Julia语言是一种高性能的开源编程语言。 Jupsc由三个过程组成：（1）计算用于计算特征向量的亲和矩阵，（2），以及（3）进行FC-MEATEL聚类。步骤（1）和（3）通过有效的并行算法计算，并且COO格式用于压缩亲和矩阵。进行两组实验以验证Jupsc的准确性和效率。实验结果表明，（1）jupsc在24核CPU上实现了大约14×〜18倍的加速，并且（2）Jupsc的串行版本比Scikit-Learn的Python版本更快。此外，考虑到模块化设计了Jupsc的结构和功能，这方便组合和与其他并行计算平台进一步优化。

著录项

来源
《Journal of Parallel and Distributed Computing》 |2020年第4期|211-221|共11页
作者
Zenan Huo; Gang Mei; Giampaolo Casolla; Fabio Giampaolo;
展开▼
作者单位

School of Engineering and Technology China University of Geosciences (Beijing) 100083 Beijing China;

School of Engineering and Technology China University of Geosciences (Beijing) 100083 Beijing China;

Department of Mathematics and Applications 'R. Caccioppoli' University of Naples FEDERICO Ⅱ Italy;

Consorzio Interuniversitario Nazionale per l'Informatica (CINI) Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Clustering algorithm; Spectral clustering; Parallel algorithm; Multi-core processors; Julia language;

机译：聚类算法;光谱聚类;并行算法;多核处理器;朱莉娅语言;

相似文献

外文文献
中文文献
专利

1. Efficient parallelisation of the packet classification algorithms on multi-core central processing units using multi-threading application program interfaces [J] . Abbasi Mahdi, Rafiee Milad Computers & Digital Techniques, IET . 2020,第6期

机译：使用多线程应用程序接口在多核中心处理单元上的分组分类算法的高效平行
2. Parallel Light Speed Labeling: an efficient connected component algorithm for labeling and analysis on multi-core processors [J] . Laurent Cabaret, Lionel Lacassagne, Daniel Etiemble Journal of Real-Time Image Processing . 2018,第1期

机译：并行光速标记：用于多核处理器的标记和分析的高效连接组件算法
3. An efficient parallel algorithm for the coupling of global climate models and regional climate models on a large-scale multi-core cluster [J] . Yuzhu Wang, Jinrong Jiang, Junqiang Zhang, Journal of supercomputing . 2018,第8期

机译：一种高效的并行算法，用于在大型多核集群上耦合全球气候模型和区域气候模型
4. Parallelization of Spectral Clustering Algorithm on Multi-core Processors and GPGPU [C] . Jing Zheng, Wenguang Chen, Yurong Chen, Asia-Pacific Computer Systems Architecture Conference . 2008

机译：多核处理器和GPGPU的光谱聚类算法的并行化
5. Designing efficient and accurate parallel genetic algorithms (Parallel algorithms). [D] . Cantu-Paz, Erick. 1999

机译：设计高效，准确的并行遗传算法（并行算法）。
6. A Parallel Architecture for the Partitioning around Medoids (PAM) Algorithm for Scalable Multi-Core Processor Implementation with Applications in Healthcare [O] . Hassan Mushtaq, Sajid Gul Khawaja, Muhammad Usman Akram, 2018

机译：围绕Medoids（PAM）算法进行分区的并行体系结构可实现可扩展的多核处理器及其在医疗保健中的应用
7. Parallel Light Speed Labeling: an efficient connected component algorithm for labeling and analysis on multi-core processors [O] . Cabaret, Laurent, Lacassagne, Lionel, Etiemble, Daniel 2016

机译：并行光速标记：用于多核处理器的标记和分析的高效连接组件算法
8. Implementation of Novel Parallel Cyclic Convolution Algorithms in Clusters and Multi-Core Architectures. [R] . Teixeira, M., Nevarez, F. 2014

机译：集群和多核架构中新型并行循环卷积算法的实现。

Designing an efficient parallel spectral clustering algorithm on multi-core processors in Julia

摘要

著录项

相似文献

相关主题

期刊订阅