A Spectral Clustering-Based Dataset Structure Analysis and OutlierDetection Progress

机译：基于谱聚类的数据集结构分析和离群检测进展

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To solve the problem that in real applications with spectral clustering algorithm, the number of clusters of a dataset is not given as a prior, a dataset structure analysis and outlier detection process is proposed in the paper. The proposed process is on the basis of spectral clustering algorithm and is consisted of four steps. The first step apply some algorithm, such as DBSCAN, which does not need the number of clusters as input to cluster the dataset to get an approximation of the number of clusters. The second step uses the approximation obtained from the first step to get the upper bound of the number of the clusters. The third step uses the upper bound obtained from the second step to get the optimal value of the number of the clusters, and output the optimal cluster result. The last step applies the LOF algorithm with the result from the third step to find the data objects with the largest probability to be outliers.

机译：针对光谱聚类算法在实际应用中没有事先给出数据集聚类数量的问题，提出了一种数据集结构分析和离群值检测方法。所提出的过程基于频谱聚类算法，并且由四个步骤组成。第一步应用某种算法，例如DBSCAN，它不需要将簇数作为输入来对数据集进行聚类以获得近似的簇数。第二步使用从第一步获得的近似值来获得簇数的上限。第三步骤使用从第二步骤获得的上限来获得聚类数的最佳值，并输出最佳聚类结果。最后一步将LOF算法与第三步的结果一起应用，以找到概率最大的数据对象是异常值。

著录项

来源
《IEA 2011;International conference on information engineering and applications》|2011年|p.76-85|共10页
会议地点
作者
Lin Hai; Zhu Qingsheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. A spectral clustering-based framework for detecting community structures in complex networks [J] . Jiang JQ, Dress AWM, Yang GK Applied mathematics letters . 2009,第9期

机译：基于频谱聚类的框架，用于检测复杂网络中的社区结构
2. RiMARS: An automated river morphodynamics analysis method based on remote sensing multispectral datasets [J] . Abolfazl Jalali Shahrood, Meseret Walle Menberu, Hamid Darabi, The Science of the Total Environment . 2020,第Juna1期

机译：RiMARS：一种基于遥感多光谱数据集的自动化河流形态动力学分析方法
3. Multitemporal Land Use and Land Cover Classification from Time-Series Landsat Datasets Using Harmonic Analysis with a Minimum Spectral Distance Algorithm [J] . Progress in Artificial Intelligence . 2020,第2期

机译：利用谐波距离算法使用谐波分析的时间序列Landsat数据集的多型土地利用和陆地覆盖分类
4. TRACKING THE PROGRESS OF A COASTAL SALTMARSH MITIGATION PROJECT USING A 10-YEAR DATASETOF HIGH RESOLUTION DIGITAL MULTI-SPECTRAL IMAGERY [C] . B. B. Nyden, D.A. Stow Fifth international airborne remote sensing conference and exhibition . 2001

机译：使用10年数据集的高分辨率数字多光谱成像技术追踪沿海盐沼缓解项目的进展
5. Supernova Classification and Supernova Astrophysics: Spectral Analysis of the Largest Datasets of Stripped-Envelope Supernovae in the World [D] . Liu, Yuqian. 2017

机译：超新星分类和超新星天体物理学：世界上最大的剥离信封超新星数据集的光谱分析
6. A Guide to Enterotypes across the Human Body: Meta-Analysis of Microbial Community Structures in Human Microbiome Datasets [O] . Omry Koren, Dan Knights, Antonio Gonzalez, 2013

机译：整个人体肠型的指南：人类微生物组数据集中微生物群落结构的荟萃分析
7. A spectral clustering-based framework for detecting community structures in complex networks [O] . Jiang Jeffrey Q., Dress Andreas W.M., Yang Genke 2009

机译：基于频谱聚类的框架，用于检测复杂网络中的社区结构

A Spectral Clustering-Based Dataset Structure Analysis and OutlierDetection Progress

摘要

著录项

相似文献

相关主题

期刊订阅