首页> 外文期刊>Bioinformatics >Graph-based consensus clustering for class discovery from gene expression data
【24h】

Graph-based consensus clustering for class discovery from gene expression data

机译:基于图的共识聚类,用于从基因表达数据中发现类别

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Consensus clustering, also known as cluster ensemble, is one of the important techniques for microarray data analysis, and is particularly useful for class discovery from microarray data. Compared with traditional clustering algorithms, consensus clustering approaches have the ability to integrate multiple partitions from different cluster solutions to improve the robustness, stability, scalability and parallelization of the clustering algorithms. By consensus clustering, one can discover the underlying classes of the samples in gene expression data. Results: In addition to exploring a graph-based consensus clustering (GCC) algorithm to estimate the underlying classes of the samples in microarray data, we also design a new validation index to determine the number of classes in microarray data. To our knowledge, this is the first time in which GCC is applied to class discovery for microarray data. Given a pre specified maximum number of classes (denoted as K_(max) in this article), our algorithm can discover the true number of classes for the samples in microarray data according to a new cluster validation index called the Modified Rand Index. Experiments on gene expression data indicate that our new algorithm can (ⅰ) outperform most of the existing algorithms, (ⅱ) identify the number of classes correctly in real cancer datasets, and (ⅲ) discover the classes of samples with biological meaning.
机译:动机:共识聚类,也称为聚类集成,是微阵列数据分析的重要技术之一,对于从微阵列数据中发现类别特别有用。与传统的聚类算法相比,共识聚类方法能够集成来自不同聚类解决方案的多个分区,以提高聚类算法的鲁棒性,稳定性,可扩展性和并行性。通过共识聚类,可以发现基因表达数据中样本的基础类别。结果:除了探索基于图的共识聚类(GCC)算法来估计微阵列数据中样本的基础类别外,我们还设计了一种新的验证指标来确定微阵列数据中的类别数量。据我们所知,这是第一次将GCC应用于微阵列数据的分类发现。给定一个预先指定的最大类别数(在本文中称为K_(max)),我们的算法可以根据称为改良兰德指数的新聚类验证索引,发现微阵列数据中样本的真实类别数。基因表达数据的实验表明,我们的新算法可以(ⅰ)胜过大多数现有算法,(ⅱ)正确识别真实癌症数据集中的类别数量,并且(ⅲ)发现具有生物学意义的样本类别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号