基于 Hadoop MapReduce并行近似谱聚类算法研究与实现

杨煜; 赵成贵

首页> 中文期刊> 《计算机应用与软件》 >基于 Hadoop MapReduce并行近似谱聚类算法研究与实现

基于 Hadoop MapReduce并行近似谱聚类算法研究与实现

AI论文写作 >>

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

With the advent of information age, the large-scale high-dimensional data generated in Internet increases exponentially, its spectral clustering suffers from the bottleneck problem in both computational time and memory use, particularly in solving Laplacian matrix eigenvector decomposition.Given the advantages of Hadoop MapReduce parallel programming model in processing intensive data, based on t nearest neighbour sparse approximation similarity Laplacian matrix, in this paper we design Hadoop MapReduce parallel approximate spectral clustering algorithm to solve the above-mentioned bottleneck problem.The experiment uses UCI Bag of Words dataset to validate the correctness and effectiveness of the designed algorithm, result indicates that the parallel design aligns with a certain desired effect in terms of spectral clustering quality and performance.%随着信息时代的来临，互联网产生的大规模高维数据呈现几何级数增长，对其进行谱聚类在计算时间和内存使用上都存在瓶颈问题，尤其是求Laplacian矩阵特征向量分解。鉴于Hadoop MapReduce并行编程模型对密集型数据处理的优势，基于t最近邻稀疏化近似相似Laplacian矩阵，设计Hadoop MapReduce并行近似谱聚类算法，以期解决上述瓶颈问题。实验使用UCI Bag of Words数据集验证所设计算法的正确性和有效性，结果显示该并行设计在谱聚类质量和性能方面达到了一定的预期效果。

著录项

来源
《计算机应用与软件》 |2015年第8期|17-21,63|共6页
作者
杨煜; 赵成贵;
展开▼
作者单位

云南财经大学信息学院云南昆明650221;

曲靖市公安局经济技术开发区分局云南曲靖 655000;

云南财经大学信息学院云南昆明650221;

展开▼
原文格式 PDF
正文语种 chi
中图分类算法理论;
关键词
Hadoop分布式系统; MapReduce并行计算; 近似谱聚类算法; 稀疏近似相似矩阵; 大规模高维数据;

相似文献

中文文献
外文文献
专利

1. 基于Hadoop云平台的并行谱聚类算法的设计与实现 [J] . 牛科 ,贾郭军 . 山西师范大学学报（自然科学版） . 2014,第001期
2. 基于Hadoop MapReduce和粗粒度并行遗传算法的大数据聚类方法改进 [J] . 郭晨晨 ,朱红康 . 黑龙江大学工程学报 . 2016,第003期
3. 基于Hadoop MapReduce和粗粒度并行遗传算法的大数据聚类方法改进 [J] . 郭晨晨 ,朱红康 . 黑龙江大学工程学报 . 2016,第003期
4. 一种基于MPI的稀疏化局部尺度并行谱聚类算法的研究与实现 [J] . 李瑞琳 ,赵永华 ,黄小磊 . 计算机工程与科学 . 2016,第005期
5. 基于MapReduce的并行蚁群算法研究与实现 [J] . 夏卫雷 ,王立松 . 电子科技 . 2013,第002期
6. 一种基于MPI的稀疏化局部尺度并行谱聚类算法的研究与实现 [C] . Li Ruilin ,李瑞琳 ,Zhao Yonghua . 2015全国高性能计算学术年会 . 2015
7. 基于Hadoop MapReduce并行近似谱聚类算法研究与实现 [A] . 杨煜 . 2014

基于 Hadoop MapReduce并行近似谱聚类算法研究与实现

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅