...
【24h】

FEM-DBSCAN: An Efficient Density-Based Clustering Approach

机译:FEM-DBSCAN:基于有效的基于密度的聚类方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Due to the uncontrollable growth of data generation in various networks, rapid clustering of massive datasets is seriously demanded in order to reveal the hidden structure of data as well as discovering the relations among samples. Among the clustering approaches, density-based clustering methods showed an acceptable processing speed to encounter with big data. However, they have some fixed parameters, which are not certainly optimized for all parts of the feature space. Moreover, the complexity of these clustering methods is highly dependent on the number of samples. In this paper, we have deployed Fisher expectation maximization (FEM) to adaptively divide the feature space into some subspaces, where no cluster is shared between the adjacent subspaces. Afterward, we applied density-based spatial clustering of applications with noise (DBSCAN) to each partition yielding to decrease the computational complexity on each thread as well as better learning of its parameters on each subspace. The performance of the proposed method was assessed over three big-size and ten middle-size datasets. The achieved results implied the superiority of the proposed method to OPTICS, Den Clue and DBSCAN methods in terms of clustering accuracy (purity) and processing time.
机译:由于各种网络中的数据生成的无法控制的增长,因此要求大量数据集的快速聚类,以揭示数据的隐藏结构以及发现样品之间的关系。在聚类方法中,基于密度的聚类方法显示了具有大数据的可接受的处理速度。但是,它们具有一些固定参数,这对于特征空间的所有部件并不肯定优化。此外,这些聚类方法的复杂性高度依赖于样品的数量。在本文中,我们已经部署了Fisher期望最大化(FEM),以便将要素空间自适应地将特征空间分成一些子空间,其中没有群集在相邻子空间之间共享。之后,我们将基于密度的空间聚类应用于具有噪声(DBSCAN)的应用程序的空间聚类,从而促使每个分区降低每个线程上的计算复杂度,以及更好地学习其在每个子空间上的参数。提出的方法的性能在三个大尺寸和十个中小型数据集中进行了评估。实现的结果暗示了在聚类精度(纯度)和处理时间方面的光学方法,Den Clue和DBSCAN方法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号