首页> 外文期刊>Bioinformatics >SamCluster: an integrated scheme for automatic discovery of sample classes using gene expression profile.
【24h】

SamCluster: an integrated scheme for automatic discovery of sample classes using gene expression profile.

机译:SamCluster:使用基因表达谱自动发现样品类别的综合方案。

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Feature (gene) selection can dramatically improve the accuracy of gene expression profile based sample class prediction. Many statistical methods for feature (gene) selection such as stepwise optimization and Monte Carlo simulation have been developed for tissue sample classification. In contrast to class prediction, few statistical and computational methods for feature selection have been applied to clustering algorithms for pattern discovery. Results: An integrated scheme and corresponding program SamCluster for automatic discovery of sample classes based on gene expression profile is presented in this report. The scheme incorporates the feature selection algorithms based on the calculation of CV (coefficient of variation) and t-test into hierarchical clustering and proceeds as follows. At first, the genes with their CV greater than the pre-specified threshold are selected for cluster analysis, which results in two putative sample classes. Then, significantly differentially expressed genes in the two putative sample classes with p-values
机译:动机:特征(基因)选择可以大大提高基于基因表达谱的样品类别预测的准确性。已经开发了许多用于特征(基因)选择的统计方法,例如逐步优化和蒙特卡洛模拟,用于组织样本分类。与类预测相反,很少有用于特征选择的统计和计算方法已应用于用于模式发现的聚类算法。结果:本报告介绍了一种基于基因表达谱自动发现样品类别的集成方案和相应程序SamCluster。该方案将基于CV(变异系数)和t检验的计算的特征选择算法合并到层次聚类中,并进行如下操作。首先,选择CV大于预定阈值的基因进行聚类分析,从而得出两个假定的样品类别。然后,从t检验中选择p值

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号