...
首页> 外文期刊>BMC Bioinformatics >Shrinkage Clustering: a fast and size-constrained clustering algorithm for biomedical applications
【24h】

Shrinkage Clustering: a fast and size-constrained clustering algorithm for biomedical applications

机译:收缩群集:生物医学应用程序的快速和大小约束聚类算法

获取原文

摘要

Many common clustering algorithms require a two-step process that limits their efficiency. The algorithms need to be performed repetitively and need to be implemented together with a model selection criterion. These two steps are needed in order to determine both the number of clusters present in the data and the corresponding cluster memberships. As biomedical datasets increase in size and prevalence, there is a growing need for new methods that are more convenient to implement and are more computationally efficient. In addition, it is often essential to obtain clusters of sufficient sample size to make the clustering result meaningful and interpretable for subsequent analysis. We introduce Shrinkage Clustering, a novel clustering algorithm based on matrix factorization that simultaneously finds the optimal number of clusters while partitioning the data. We report its performances across multiple simulated and actual datasets, and demonstrate its strength in accuracy and speed applied to subtyping cancer and brain tissues. In addition, the algorithm offers a straightforward solution to clustering with cluster size constraints. Given its ease of implementation, computing efficiency and extensible structure, Shrinkage Clustering can be applied broadly to solve biomedical clustering tasks especially when dealing with large datasets.
机译:许多常见的聚类算法需要一个限制其效率的两步过程。需要重复地执行算法,并且需要与模型选择标准一起实现。需要这两个步骤,以便确定数据中存在的群集数和相应的集群成员资格。随着生物医学数据集的尺寸和流行增加,对更方便实施的新方法越来越需要,并且更加计算效率。此外,获得足够的样本大小的集群通常是必不可少的,以使聚类结果有意义和解释随后的分析。我们引入收缩群集,一种基于矩阵分解的新型聚类算法,同时在分区数据时同时找到最佳群集数。我们在多种模拟和实际数据集中报告其性能,并展示其在亚型癌症和脑组织上的准确性和速度的强度。此外,该算法还提供了与群集大小约束聚类的直接解决方案。鉴于实现的实施方便,计算效率和可扩展结构,可以广泛应用收缩群集,以解决生物医学聚类任务,特别是在处理大型数据集时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号