A-BIRCH: Automatic Threshold Estimation for the BIRCH Clustering Algorithm

机译：A-BIRCH：BIRCH聚类算法的自动阈值估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Clustering algorithms are recently regaining attention with the availability of large datasets and the rise of parallelized computing architectures. However, most clustering algorithms do not scale well with increasing dataset sizes and require proper parametrization for correct results. In this paper we present A-BIRCH, an approach for automatic threshold estimation for the BIRCH clustering algorithm using Gap Statistic. This approach renders the global clustering step of BIRCH unnecessary and does not require knowledge on the expected number of clusters beforehand. This is achieved by analyzing a small representative subset of the data to extract attributes such as the cluster radius and the minimal cluster distance. These attributes are then used to compute a threshold that results, with high probability, in the correct clustering of elements. For the analysis of the representative subset we parallelized Gap Statistic to improve performance and ensure scalability.

机译：群集算法最近通过大型数据集的可用性和并行化计算架构的兴起来重新启发注意力。但是，大多数聚类算法不会越来越好，随着数据集大小，并且需要适当的参数化以进行正确的结果。在本文中，我们呈现A-BIRCH，使用间隙统计来实现桦木聚类算法的自动阈值估计方法。这种方法使桦木的全球聚类步骤呈现不必要的，并且不需要预先对预期的群集数量的知识。这是通过分析数据的小代表性子集来实现，以提取诸如簇半径和最小簇距离的属性。然后使用这些属性来计算在对元素的正确聚类中具有高概率的阈值。为了分析代表子集，我们并行化缺口统计数据以提高性能并确保可扩展性。

著录项

来源
《International Neural Network Society Conference on Big Data》|2017年|xvii 348 p.|共10页
会议地点
作者
Boris Lorbeer; Ana Kosareva; Bersant Deva; D?enan Softi?; Peter Ruppel; Axel Küpper;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-532;
关键词
Automatic; Threshold; Estimation;

机译：自动;阈值;估计;
入库时间 2022-08-21 12:15:59

相似文献

外文文献
中文文献
专利

1. Combustion Quality Estimation in Power Station Boilers using Median Threshold Clustering Algorithms [J] . K.Sujatha, Dr.N.Pappa, A. Kalaivani International Journal of Engineering Science and Technology . 2010,第7期

机译：基于中值阈值聚类算法的电站锅炉燃烧质量估算
2. AUTOMATIC MULTILEVEL THRESHOLDING BASED ON TWO-STAGE OTSU'S METHOD WITH CLUSTER DETERMINATION BY VALLEY ESTIMATION [J] . Deng-Yuan Huang, Ta-Wei Lin, Wu-Chih Hu International Journal of Innovative Computing Information and Control . 2011,第10期

机译：基于两阶段OTSU法的谷歌估计聚类自动多阈值法
3. Automatic detection of solitary lung nodules using quality threshold clustering, genetic algorithm and diversity index [J] . Antonio Oseas de Carvalho Filho, Wener Borges de Sampaio, Aristofanes Correa Silva, Artificial intelligence in medicine . 2014,第3期

机译：使用质量阈值聚类，遗传算法和多样性指数自动检测孤立性肺结节
4. A-BIRCH: Automatic Threshold Estimation for the BIRCH Clustering Algorithm [C] . Boris Lorbeer, Ana Kosareva, Bersant Deva, International Neural Network Society Conference on Big Data . 2017

机译：A-BIRCH：桦木聚类算法的自动阈值估计
5. Imputation of automatic control algorithms and estimation in high-dimensional linear regression [D] . Ye, Fei. 2010

机译：高维线性回归中自动控制算法的推算和估计
6. Cross-Clustering: A Partial Clustering Algorithm with Automatic Estimation of the Number of Clusters [O] . Paola Tellaroli, Marco Bazzi, Michele Donato, -1

机译：跨集群：具有自动估计集群数量的部分集群算法
7. AutoClustering An Estimation of Distribution Algorithm for the Automatic Generation of Clustering Algorithms [O] . A S. G. Meiguins, Roberto C. Limão, Belém Brasil, 2013

机译：自动聚类自动生成聚类算法的分布算法估计

A-BIRCH: Automatic Threshold Estimation for the BIRCH Clustering Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅