Improvement and Parallelism of k-Means Clustering Algorithm

TIAN Jinlan; ZHU Lin; ZHANG Suqin; LIU Lu

首页> 中文期刊> 《清华大学学报（英文版）》 >Improvement and Parallelism of k-Means Clustering Algorithm

Improvement and Parallelism of k-Means Clustering Algorithm

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

The k-means clustering algorithm is one of the most commonly used algorithms for clustering analysis. The traditional k-means algorithm is, however, inefficient while working on large numbers of data sets and improving the algorithm efficiency remains a problem. This paper focuses on the efficiency issues of cluster algorithms. A refined initial cluster centers method is designed to reduce the number of iterative procedures in the algorithm. A parallel k-means algorithm is also studied for the problem of the operation limitation of a single processor machine when given huge data sets. The analytical results demonstrate that these improvements can greatly enhance the efficiency of the k-means algorithm, i.e., allow the grouping of a large number of data sets more accurately and more quickly. The analysis has theoretical and practical importance for work on the improvement and parallelism of cluster algorithms.

著录项

来源
《清华大学学报（英文版）》 |2005年第3期|277-281|共5页
作者
TIAN Jinlan; ZHU Lin; ZHANG Suqin; LIU Lu;
展开▼
作者单位

Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;

展开▼
原文格式 PDF
正文语种 chi
中图分类数学;
关键词
data mining; cluster analysis; k-means algorithm; parallelism;

机译：数据挖掘聚类分析k均值算法并行度;

Improvement and Parallelism of k-Means Clustering Algorithm

摘要

著录项

相关主题

期刊订阅