首页> 外文期刊>Bioinformatics >Evaluation and optimization of clustering in gene expression data analysis
【24h】

Evaluation and optimization of clustering in gene expression data analysis

机译:基因表达数据分析中聚类的评估和优化

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: A measurement of cluster quality is needed to choose potential clusters of genes that contain biologically relevant patterns of gene expression. This is strongly desirable when a large number of gene expression profiles have to be analyzed and proper clusters of genes need to be identified for further analysis, such as the search for meaningful patterns, identification of gene functions or gene response analysis. Results: We propose a new cluster quality method, called stability, by which unsupervised learning of gene expression data can be performed efficiently. The method takes into account a cluster's stability on partition. We evaluate this method and demonstrate its performance using four independent, real gene expression and three simulated datasets. We demonstrate that our method outperforms other techniques listed in the literature. The method has applications in evaluating clustering validity as well as identifying stable clusters.
机译:动机:需要对簇的质量进行测量,以选择可能含有基因表达生物学相关模式的潜在簇。当必须分析大量基因表达谱并且需要鉴定适当的基因簇以进行进一步分析(例如寻找有意义的模式,鉴定基因功能或基因响应分析)时,这是非常需要的。结果:我们提出了一种新的簇质量方法,称为稳定性,通过该方法可以有效地进行基因表达数据的无监督学习。该方法考虑了群集在分区上的稳定性。我们评估此方法,并使用四个独立的真实基因表达和三个模拟数据集证明其性能。我们证明了我们的方法优于文献中列出的其他技术。该方法在评估聚类有效性以及识别稳定聚类中具有应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号