【24h】

Data Clustering Approaches Survey and Analysis

机译:数据聚类方法调查和分析

获取原文

摘要

In the current world, there is a need to analyze and extract information from data. Clustering is one such analytical method which involves the distribution of data into groups of identical objects. Every group is known as a cluster, which consists of objects that have affinity within the cluster and disparity with the objects in other groups. This paper is intended to examine and evaluate various data clustering algorithms. The two major categories of clustering approaches are partition and hierarchical clustering. The algorithms which are dealt here are: k-means clustering algorithm, hierarchical clustering algorithm, density based clustering algorithm, self-organizing map algorithm, and expectation maximization clustering algorithm. All the mentioned algorithms are explained and analyzed based on the factors like the size of the dataset, type of the data set, number of clusters created, quality, accuracy and performance. This paper also provides the information about the tools which are used to implement the clustering approaches. The purpose of discussing the various software/tools is to make the beginners and new researchers to understand the working, which will help them to come up with new product and approaches for the improvement.
机译:在当前的世界中,需要从数据分析和提取信息。聚类是一种这样的分析方法,涉及数据分布到相同对象的组中。每个组都被称为群集,它由对象在群集中具有关联的对象和其他组中的对象。本文旨在检查和评估各种数据聚类算法。两种主要类别的聚类方法是分区和分层群集。处理此处的算法是:K-Meansic聚类算法,分层聚类算法,基于密度的聚类算法,自组织地图算法和期望最大化聚类算法。根据数据集的大小,数据集的类型,创建的群集数,质量,准确性和性能等因素,解释和分析所有提到的算法。本文还提供有关用于实现聚类方法的工具的信息。讨论各种软件/工具的目的是使初学者和新的研究人员了解工作,这将有助于他们提出新产品和改进方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号