首页> 外文期刊>Journal of machine learning research >Kernel-estimated Nonparametric Overlap-Based Syncytial Clustering
【24h】

Kernel-estimated Nonparametric Overlap-Based Syncytial Clustering

机译:内核估计基于非参数重叠的Syncytial聚类

获取原文
           

摘要

Commonly-used clustering algorithms usually find ellipsoidal, spherical or other regular-structured clusters, but are more challenged when the underlying groups lack formal structure or definition. Syncytial clustering is the name that we introduce for methods that merge groups obtained from standard clustering algorithms in order to reveal complex group structure in the data. Here, we develop a distribution-free fully-automated syncytial clustering algorithm that can be used with $k$-means and other algorithms. Our approach estimates the cumulative distribution function of the normed residuals from an appropriately fit $k$-groups model and calculates the estimated nonparametric overlap between each pair of clusters. Groups with high pairwise overlap are merged as long as the estimated generalized overlap decreases. Our methodology is always a top performer in identifying groups with regular and irregular structures in several datasets and can be applied to datasets with scatter or incomplete records. The approach is also used to identify the distinct kinds of gamma ray bursts in the Burst and Transient Source Experiment 4Br catalog and the distinct kinds of activation in a functional Magnetic Resonance Imaging study.
机译:常用的聚类算法通常会发现椭圆形,球形或其他常规结构的集群,但在底层群体缺乏正式结构或定义更挑战。合胞聚类是我们介绍的方法,为了从标准聚类算法获得的合并组中的数据揭示了复杂的组结构的名称。在这里,我们开发出可与$ķ$ -means等算法可以使用免费的分布完全自动化的合胞聚类算法。我们的方法从适当配合$ $ķ模型 - 基团估计赋范残差的累积分布函数,并计算每对簇之间的估计的非参数重叠。高成对组重叠合并只要估计广义重叠减少。我们的方法总是在识别几个数据集有定期和不定期的结构基团的表现最好,可用于与分散或不完整记录的数据集。该方法也被用于识别在该脉冲串和暂态源实验4BR目录和不同种活化的以功能性磁共振成像研究的不同种伽马射线突发。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号