Data Clustering Approaches Survey and Analysis

机译：数据聚类方法调查和分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the current world, there is a need to analyze and extract information from data. Clustering is one such analytical method which involves the distribution of data into groups of identical objects. Every group is known as a cluster, which consists of objects that have affinity within the cluster and disparity with the objects in other groups. This paper is intended to examine and evaluate various data clustering algorithms. The two major categories of clustering approaches are partition and hierarchical clustering. The algorithms which are dealt here are: k-means clustering algorithm, hierarchical clustering algorithm, density based clustering algorithm, self-organizing map algorithm, and expectation maximization clustering algorithm. All the mentioned algorithms are explained and analyzed based on the factors like the size of the dataset, type of the data set, number of clusters created, quality, accuracy and performance. This paper also provides the information about the tools which are used to implement the clustering approaches. The purpose of discussing the various software/tools is to make the beginners and new researchers to understand the working, which will help them to come up with new product and approaches for the improvement.

机译：在当前的世界中，需要从数据分析和提取信息。聚类是一种这样的分析方法，涉及数据分布到相同对象的组中。每个组都被称为群集，它由对象在群集中具有关联的对象和其他组中的对象。本文旨在检查和评估各种数据聚类算法。两种主要类别的聚类方法是分区和分层群集。处理此处的算法是：K-Meansic聚类算法，分层聚类算法，基于密度的聚类算法，自组织地图算法和期望最大化聚类算法。根据数据集的大小，数据集的类型，创建的群集数，质量，准确性和性能等因素，解释和分析所有提到的算法。本文还提供有关用于实现聚类方法的工具的信息。讨论各种软件/工具的目的是使初学者和新的研究人员了解工作，这将有助于他们提出新产品和改进方法。

著录项

来源
《International Conference on Futuristic Trends on Computational Analysis and Knowledge Management 》|2015年||共6页
会议地点
作者
Ahalya G.; Hari Mohan Pandey;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 O241-532;
关键词
Clustering; Expectation maximization clustering algorithm; Hierarchical clustering; K-means clustering algorithm; Self-organization maps algorithm; Density based clustering algorithm;

机译：聚类;期望最大化聚类算法;分层聚类;k均值聚类算法;自组织地图算法;基于密度的聚类算法;

相似文献

外文文献
中文文献
专利

1. Visual Approaches for Exploratory Data Analysis: A Survey of the Visual Assessment of Clustering Tendency (VAT) Family of Algorithms [J] . Kumar Dheeraj, Bezdek James C. IEEE Systems, Man, and Cybernetics Magazine . 2020 ,第2期

机译：探索性数据分析的视觉方法：对算法聚类趋势的视觉评估调查（算法
2. An Empirical Comparison of Several Clustered Data Approaches Under Confounding Due to Cluster Effects in the Analysis of Complications of Coronary Angioplasty [J] . Jesse A. Berlin, Stephen E. Kimmel, Thomas R. Ten Have, Biometrics: Journal of the Biometric Society : An International Society Devoted to the Mathematical and Statistical Aspects of Biology . 1999 ,第2期

机译：冠状血管成形术并发症分析中因聚类效应而混杂的几种聚类数据方法的经验比较
3. A Preliminary Survey on Optimized Multiobjective Metaheuristic Methods for Data Clustering Using Evolutionary Approaches [J] . Ramachandra Rao Kurada, K Karteeka Pavan, AV Dattareya Rao International Journal of Computer Science & Information Technology (IJCSIT) . 2013 ,第5期

机译：使用进化方法的数据聚类优化多目标元启发式方法初探
4. The survey on approaches to efficient clustering and classification analysis of big data [C] . Bhagyashri S. Gandhi, Leena A. Deshpande International Conference on Computing, Communication, Control and Automation . 2016

机译：大数据有效聚类和分类分析方法研究
5. Novel Approaches to Creating Synthetic Data from Multivariate Survey Data for Statistical Disclosure Control [D] . Chen, Allshine. 2020

机译：从多变量调查数据创建合成数据的新方法进行统计泄露控制
6. Care-seeking and appropriate treatment for childhood acute respiratory illness: an analysis of Demographic and Health Survey and Multiple Indicators Cluster Survey datasets for high-mortality countries [O] . Emily M Mosites, Alastair I Matheson, Eli Kern, 2014

机译：儿童急性呼吸道疾病的寻求护理和适当治疗：高死亡率国家的人口与健康调查和多指标类集调查数据集的分析
7. A Preliminary Survey on Optimized Multiobjective Metaheuristic Methods for Data Clustering Using Evolutionary Approaches [O] . Ramachandra Rao Kurada, K Karteeka Pavan, AV Dattareya Rao 2013

机译：使用进化方法的数据聚类优化多目标元启发式方法初探
8. Cluster Analysis-Based Approaches for Geospatiotemporal Data Mining of Massive Data Sets for Identification of Forest Threats. [R] . Mills, R. T., Hoffman, F. M., Kumar, J., 2011

机译：基于聚类分析的海量数据集地理时空数据挖掘方法用于森林威胁识别。

Data Clustering Approaches Survey and Analysis

摘要

著录项

相似文献

相关主题

期刊订阅