首页> 外文会议>International Conference on High Performance Computing and Applications >Scalable parallel clustering approach for large data using parallel K means and firefly algorithms

【24h】

Scalable parallel clustering approach for large data using parallel K means and firefly algorithms

机译：使用并行K均值和萤火虫算法的大数据可扩展并行聚类方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper mainly focuses in identifying the limitations of the k means algorithm and to propose the parallelization of the k-means using firefly based clustering method. The new parallel architecture can handle large number of clusters. Firefly algorithm to find initial optimal cluster centroid and then k-means algorithm with optimized centroid to refined them and improve clustering accuracy. The final convergence issue is also addressed and solved to a great extent. Finally modified algorithm is compared with parallel k means is demonstrated with experiments and it has been found that the performance of modified algorithm is better than the existing algorithm. Four typical benchmark data sets from the UCI machine learning repository are used to demonstrate the results of the techniques. To achieve this we can use fork/join method in java programming. It is the most effective design method for achieve good parallel performance.

机译：本文主要着眼于确定k均值算法的局限性，并提出使用基于萤火虫的聚类方法对k均值进行并行化。新的并行体系结构可以处理大量集群。 Firefly算法先找到初始的最佳聚类质心，然后使用具有优化质心的k-means算法精炼它们并提高聚类精度。最后的收敛问题也得到了很大程度的解决。最后通过实验验证了改进算法与并行k均值的比较，发现改进算法的性能优于现有算法。使用UCI机器学习存储库中的四个典型基准数据集来演示该技术的结果。为此，我们可以在Java编程中使用fork / join方法。这是实现良好并行性能的最有效设计方法。

著录项

来源
《International Conference on High Performance Computing and Applications 》|2014年|1-8|共8页
会议地点
作者
Mathew Juby; Vijayakumar R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Java; learning (artificial intelligence); parallel architectures; parallel programming; pattern clustering; Java programming; UCI machine learning repository; convergence issue; firefly based clustering method; initial optimal cluster centroid; k-means algorithm; k-means parallelization; parallel architecture; parallel k-means; parallel performance; scalable parallel clustering approach; Abstracts; Graphics; Instruction sets; Motion measurement; Optimization; Robots; Clustering; Firefly algorithm; join and fork parallelism; k-means; parallel k-means;

机译：Java;学习（人工智能）;并行体系结构;并行编程;模式聚类; Java编程; UCI机器学习存储库;收敛性;基于萤火虫的聚类方法;初始最优聚类质心; k-means算法; k-means并行化;并行体系结构并行k均值并行性能可扩展的并行聚类方法摘要图形指令集运动测量优化机器人集群Firefly算法联叉并行k均值并行k均值;

相似文献

外文文献
中文文献
专利

1. Scalable parallel clustering using modified Firefly algorithm [J] . Juby Mathew, R. Vijayakumar IOSR journal of computer engineering . 2014 ,第6期

机译：使用改进的Firefly算法的可扩展并行集群
2. 3D Kirchhoff depth migration algorithm: A new scalable approach for parallelization on multicore CPU based cluster [J] . Rastogi Richa, Londhe Ashutosh, Srivastava Abhishek, Computers & geosciences . 2017 ,第MARa期

机译：3D Kirchhoff深度迁移算法：基于多核CPU的集群并行化的新可扩展方法
3. Parallel WaveCluster: A linear scaling parallel clustering algorithm implementation with application to very large datasets [J] . Ahmet Artu Yildinm, Cem Ozdogan Journal of Parallel and Distributed Computing . 2011 ,第7期

机译：并行WaveCluster：一种线性缩放并行聚类算法实现，适用于非常大的数据集
4. A Service-oriented Approach for the Parallelization of Data-intensive Algorithms in a Grid-enabled Cluster [C] . Chun-Wu Chen, Roehm, U. . 2005

机译：面向服务的网格启用集群中数据密集型算法并行化的方法
5. Scalable clustering algorithms and optimization methods for parallel architectures. [D] . Khlopotine, Andrei B. 2015

机译：并行体系结构的可伸缩群集算法和优化方法。
6. Analysis of Parallel Algorithms on SMP Node and Cluster of Workstations Using Parallel Programming Models with New Tile-based Method for Large Biological Datasets [O] . D. D. Shrimankar, S. R. Sathe 2016

机译：大型生物数据集基于新图块的并行编程模型对SMP节点和工作站集群的并行算法进行分析
7. Scalable Parallel Clustering Approach for Large Data using Possibilistic Fuzzy C-Means Algorithm [O] . Juby Mathew, R Vijayakumar 2014

机译：使用可能性模糊C型算法的大数据可扩展并行聚类方法

Scalable parallel clustering approach for large data using parallel K means and firefly algorithms

摘要

著录项

相似文献

相关主题

期刊订阅