首页> 外文期刊>Information Technology Journal >A Survey of Partition based Clustering Algorithms in Data Mining: An Experimental Approach
【24h】

A Survey of Partition based Clustering Algorithms in Data Mining: An Experimental Approach

机译:数据挖掘中基于分区的聚类算法研究:一种实验方法

获取原文
           

摘要

Clustering is one of the most important research areas in the field of data mining. Clustering means creating groups of objects based on their features in such a way that the objects belonging to the same groups are similar and those belonging in different groups are dissimilar. Clustering is an unsupervised learning technique. Data clustering is the subject of active research in several fields such as statistics, pattern recognition and machine learning. From a practical perspective clustering plays an outstanding role in data mining applications in many domains. The main advantage of clustering is that interesting patterns and structures can be found directly from very large data sets with little or none of the background knowledge. Clustering algorithms can be applied in many areas, for instance marketing, biology, libraries, insurance, city-planning, earthquake studies and www document classification. Data mining adds to clustering the complications of very large datasets with very many attributes of different types. This imposes unique computational requirements on relevant clustering algorithms. A variety of algorithms have recently emerged that meet these requirements and were successfully applied to real-life data mining problems. They are subject of this survey. Also, this survey explores the behavior of some of the partition based clustering algorithms and their basic approaches with experimental results.
机译:集群是数据挖掘领域中最重要的研究领域之一。聚类意味着根据对象的特征创建对象组,这样,属于同一组的对象是相似的,而属于不同组的对象则是不同的。聚类是一种无监督的学习技术。数据聚类是统计,模式识别和机器学习等多个领域中积极研究的主题。从实际的角度来看,集群在许多领域的数据挖掘应用程序中发挥着杰出的作用。聚类的主要优点是,可以从非常少的背景知识或几乎没有背景知识的非常大的数据集中直接找到有趣的模式和结构。聚类算法可应用于许多领域,例如市场营销,生物学,图书馆,保险,城市规划,地震研究和www文档分类。数据挖掘使聚类非常复杂的大型数据集的复杂性更加容易。这对相关的聚类算法提出了独特的计算要求。最近出现了满足这些要求的各种算法,这些算法已成功应用于现实生活中的数据挖掘问题。他们是本次调查的主题。此外,本次调查还通过实验结果探索了一些基于分区的聚类算法的行为及其基本方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号