【24h】

Model-based overlapping clustering

机译:基于模型的重叠聚类

获取原文

摘要

While the vast majority of clustering algorithms are partitional, many real world datasets have inherently overlapping clusters. Several approaches to finding overlapping clusters have come from work on analysis of biological datasets. In this paper, we interpret an overlapping clustering model proposed by Segal et al. [23] as a generalization of Gaussian mixture models, and we extend it to an overlapping clustering model based on mixtures of any regular exponential family distribution and the corresponding Bregman divergence. We provide the necessary algorithm modifications for this extension, and present results on synthetic data as well as subsets of 20-Newsgroups and EachMovie datasets.
机译:尽管绝大多数聚类算法都是分区性的,但许多现实世界的数据集具有内在重叠的聚类。寻找重叠簇的几种方法来自生物学数据集的分析工作。在本文中,我们解释了Segal等人提出的重叠聚类模型。 [23]作为高斯混合模型的推广,我们将其扩展到基于任何规则指数族分布和相应的Bregman发散的混合的重叠聚类模型。我们为此扩展提供了必要的算法修改,并提供了合成数据以及20个新闻组和EachMovie数据集的子集的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号