首页> 外文会议>International Conference on Enterprise Information Systems >ROBUST, GENERALIZED, QUICK AND EFFICIENT AGGLOMERATIVE CLUSTERING
【24h】

ROBUST, GENERALIZED, QUICK AND EFFICIENT AGGLOMERATIVE CLUSTERING

机译:强大,广义,快速,快速高效的群集聚类

获取原文

摘要

Hierarchical approaches, which are dominated by the generic agglomerative clustering algorithm, are suitable for cases in which the count of distinct clusters in the data is not known a priori; this is not a rare case in real data. On the other hand, important problems are related to their application, such as susceptibility to errors in the initial steps that propagate all the way to the final output and high complexity. Finally, similarly to all other clustering techniques, their efficiency decreases as the dimensionality of their input increases. In this paper we propose a robust, generalized, quick and efficient extension to the generic agglomerative clustering process. Robust refers to the proposed approach's ability to overcome the classic algorithm's susceptibility to errors in the initial steps, generalized to its ability to simultaneously consider multiple distance metrics, quick to its suitability for application to larger datasets via the application of the computationally expensive components to only a subset of the available data samples and efficient to its ability to produce results that are comparable to those of trained classifiers, largely outperforming the generic agglomerative process.
机译:由通用凝聚聚类算法主导的分层方法适用于数据中不同群集的计数不知道先验的情况;这不是实际数据中的罕见情况。另一方面,重要的问题与其应用有关,例如对初始步骤中的错误的易感性,这些步骤一直传播到最终输出和高复杂性。最后,与所有其他聚类技术类似,随着输入的维度增加,它们的效率降低。在本文中,我们提出了对通用附聚类聚类过程的稳健,广义快速,有效的延伸。强大的是提出的方法克服经典算法对初始步骤中的错误的易感性的能力,推广到其同时考虑多个距离度量的能力,通过应用计算昂贵的组件仅应用于更大的数据集的适用性。可用数据样本的一个子集和有效地产生与训练分类器相当的结果的能力,在很大程度上优于通用附聚过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号