首页> 外文期刊>Cancer Informatics >Prediction of Breast Cancer Metastasis by Gene Expression Profiles: A Comparison of Metagenes and Single Genes
【24h】

Prediction of Breast Cancer Metastasis by Gene Expression Profiles: A Comparison of Metagenes and Single Genes

机译:基因表达谱对乳腺癌转移的预测:亚基因和单基因的比较

获取原文
           

摘要

Background: The popularity of a large number of microarray applications has in cancer research led to the development of predictive or prognostic gene expression profiles. However, the diversity of microarray platforms has made the full validation of such profiles and their related gene lists across studies difficult and, at the level of classification accuracies, rarely validated in multiple independent datasets. Frequently, while the individual genes between such lists may not match, genes with same function are included across such gene lists. Development of such lists does not take into account the fact that genes can be grouped together as metagenes (MGs) based on common characteristics such as pathways, regulation, or genomic location. Such MGs might be used as features in building a predictive model applicable for classifying independent data. It is, therefore, demanding to systematically compare independent validation of gene lists or classifiers based on metagene or individual gene (SG) features.Methods: In this study we compared the performance of either metagene- or single gene-based feature sets and classifiers using random forest and two support vector machines for classifier building. The performance within the same dataset, feature set validation performance, and validation performance of entire classifiers in strictly independent datasets were assessed by 10 times repeated 10-fold cross validation, leave-one-out cross validation, and one-fold validation, respectively. To test the significance of the performance difference between MG- and SG-features/classifiers, we used a repeated down-sampled binomial test approach.Results: MG- and SG-feature sets are transferable and perform well for training and testing prediction of metastasis outcome in strictly independent data sets, both between different and within similar microarray platforms, while classifiers had a poorer performance when validated in strictly independent datasets. The study showed that MG- and SG-feature sets perform equally well in classifying independent data. Furthermore, SG-classifiers significantly outperformed MG-classifier when validation is conducted between datasets using similar platforms, while no significant performance difference was found when validation was performed between different platforms.Conclusion: Prediction of metastasis outcome in lymph node–negative patients by MG- and SG-classifiers showed that SG-classifiers performed significantly better than MG-classifiers when validated in independent data based on the same microarray platform as used for developing the classifier. However, the MG- and SG-classifiers had similar performance when conducting classifier validation in independent data based on a different microarray platform. The latter was also true when only validating sets of MG- and SG-features in independent datasets, both between and within similar and different platforms.
机译:背景:在癌症研究中,大量微阵列应用的普及导致了预测性或预后性基因表达谱的发展。然而,微阵列平台的多样性使得难以在整个研究中对此类图谱及其相关基因列表进行全面验证,并且在分类准确性的水平上,很少在多个独立的数据集中进行验证。通常,尽管此类列表之间的单个基因可能不匹配,但具有相同功能的基因却包含在此类基因列表中。此类清单的开发未考虑以下事实,即可以基于共同的特征(例如途径,调控或基因组位置)将基因分组为元基因(MG)。此类MG可用作构建适用于对独立数据进行分类的预测模型的功能。因此,它需要系统地比较基于元基因或单个基因(SG)特征的基因列表或分类器的独立验证。方法:在这项研究中,我们比较了基于元基因或单个基因的特征集和分类器的性能,使用随机森林和两个支持向量机用于分类器构建。分别通过重复10倍交叉验证,留一法交叉验证和一倍验证分别进行10次评估,来评估同一数据集内的性能,功能集验证性能以及整个分类器在严格独立的数据集中的验证性能。为了测试MG特征和SG特征/分类器之间性能差异的重要性,我们使用了重复的降采样二项式检验方法。结果:MG特征和SG特征集是可转移的,并且在转移的训练和测试预测中表现良好在不同微阵列平台之间以及在类似微阵列平台内的严格独立数据集中,结果得到了证实,而在严格独立的数据集中进行验证时,分类器的性能较差。研究表明,MG和SG功能集在对独立数据进行分类方面表现出色。此外,当使用相似平台在数据集之间进行验证时,SG分类器明显优于MG分类器,而在不同平台之间进行验证时,则没有发现明显的性能差异。结论:MG-预测淋巴结阴性患者的转移结局SG分类器和SG分类器显示,当基于用于开发分类器的相同微阵列平台在独立数据中进行验证时,SG分类器的性能明显优于MG分类器。但是,在基于不同微阵列平台的独立数据中进行分类器验证时,MG和SG分类器具有相似的性能。当仅验证独立数据集中的相似和不同平台之间以及之内的MG和SG功能集时,后者也适用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号