...
首页> 外文期刊>Knowledge-Based Systems >Density peaks clustering with gap-based automatic center detection
【24h】

Density peaks clustering with gap-based automatic center detection

机译:密度峰集聚类与基于差距的自动中心检测

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Clustering is a task used to group data from variegated sources, including Big Data, the Internet of Things, and social media. Density peaks clustering (DPC) has become a popular clustering technique for its simplicity and quality. However, DPC requires a proper subset of input data points to be selected as centers using a plot called "decision graph". This manual specification adds subjectivity and instability, besides breaking the continuous flow of the algorithm. Automatic center detection approaches struggle with obtaining good results while avoiding to add parameters and complexity to the algorithm. We propose an approach to automatically determine cluster centers by detecting gaps between data points in a one-dimensional version of the decision graph; we detect these gaps heuristically by comparing the distance (difference) between pairs of consecutive points in terms of their gamma score. We tested our approach on synthetic and UCI data sets. Results show that the number of clusters is accurately predicted in comparison to other state-of-the-art methods using F-score and Adjusted Rand Index. (C) 2020 Elsevier B.V. All rights reserved.
机译:群集是用于将数据从variegated来源进行分组的任务,包括大数据,事物和社交媒体。密度峰集群(DPC)已成为其简单和质量的流行聚类技术。然而,DPC需要使用称为“决策图”的绘图的中心作为中心选择的适当输入数据点。除了破坏算法的连续流程,本手册规范增加了主观性和不稳定性。自动中心检测方法努力获得良好的效果,同时避免向算法添加参数和复杂性。我们提出了一种方法来通过检测决策图的一维版本中的数据点之间的间隙来自动确定集群中心;通过在伽玛分数方面比较连续点对之间的距离(差异)来检测这些空隙。我们在合成和UCI数据集中测试了我们的方法。结果表明,与使用F分数和调整的RAND指数的其他最先进的方法相比,准确预测了簇的数量。 (c)2020 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号