首页> 外文会议>IEA 2011;International conference on information engineering and applications >A Spectral Clustering-Based Dataset Structure Analysis and OutlierDetection Progress
【24h】

A Spectral Clustering-Based Dataset Structure Analysis and OutlierDetection Progress

机译:基于谱聚类的数据集结构分析和离群检测进展

获取原文

摘要

To solve the problem that in real applications with spectral clustering algorithm, the number of clusters of a dataset is not given as a prior, a dataset structure analysis and outlier detection process is proposed in the paper. The proposed process is on the basis of spectral clustering algorithm and is consisted of four steps. The first step apply some algorithm, such as DBSCAN, which does not need the number of clusters as input to cluster the dataset to get an approximation of the number of clusters. The second step uses the approximation obtained from the first step to get the upper bound of the number of the clusters. The third step uses the upper bound obtained from the second step to get the optimal value of the number of the clusters, and output the optimal cluster result. The last step applies the LOF algorithm with the result from the third step to find the data objects with the largest probability to be outliers.
机译:针对光谱聚类算法在实际应用中没有事先给出数据集聚类数量的问题,提出了一种数据集结构分析和离群值检测方法。所提出的过程基于频谱聚类算法,并且由四个步骤组成。第一步应用某种算法,例如DBSCAN,它不需要将簇数作为输入来对数据集进行聚类以获得近似的簇数。第二步使用从第一步获得的近似值来获得簇数的上限。第三步骤使用从第二步骤获得的上限来获得聚类数的最佳值,并输出最佳聚类结果。最后一步将LOF算法与第三步的结果一起应用,以找到概率最大的数据对象是异常值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号