首页> 外文期刊>Bioinformatics >Multiclass cancer classification and biomarker discovery using GA-based algorithms
【24h】

Multiclass cancer classification and biomarker discovery using GA-based algorithms

机译:使用基于遗传算法的多分类癌症分类和生物标志物发现

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: The development of microarray-based high-throughput gene profiling has led to the hope that this technology could provide an efficient and accurate means of diagnosing and classifying tumors, as well as predicting prognoses and effective treatments. However, the large amount of data generated by microarrays requires effective reduction of discriminant gene features into reliable sets of tumor biomarkers for such multiclass tumor discrimination. The availability of reliable sets of biomarkers, especially serum biomarkers, should have a major impact on our understanding and treatment of cancer.Results: We have combined genetic algorithm (GA) and all paired (AP) support vector machine (SVM) methods for multiclass cancer categorization. Predictive features can be automatically determined through iterative GA/SVM, leading to very compact sets of non-redundant cancer-relevant genes with the best classification performance reported to date. Interestingly, these different classifier sets harbor only modest overlapping gene features but have similar levels of accuracy in leave-one-out cross-validations (LOOCV). Further characterization of these optimal tumor discriminant features, including the use of nearest shrunken centroids (NSC), analysis of annotations and literature text mining, reveals previously unappreciated tumor subclasses and a series of genes that could be used as cancer biomarkers. With this approach, we believe that microarray-based multiclass molecular analysis can be an effective tool for cancer biomarker discovery and subsequent molecular cancer diagnosis.
机译:动机:基于微阵列的高通量基因谱分析的发展已导致人们希望,该技术可以提供一种有效且准确的手段来对肿瘤进行诊断和分类,以及预测预后和有效治疗。然而,由微阵列产生的大量数据需要将可区分的基因特征有效地减少为用于此类多类肿瘤鉴别的可靠的肿瘤生物标志物组。结果:我们结合了遗传算法(GA)和所有成对(AP)支持向量机(SVM)的方法,用于多类癌症的诊断和治疗,这将对我们对癌症的理解和治疗产生重大影响。癌症分类。可以通过迭代GA / SVM自动确定预测特征,从而生成非常紧凑的与癌症相关的非冗余基因集,具有迄今为止报道的最佳分类性能。有趣的是,这些不同的分类器集仅包含适度的重叠基因特征,但在留一法式交叉验证(LOOCV)中具有相似的准确性水平。这些最佳肿瘤区分特征的进一步表征,包括使用最近的收缩质心(NSC),注释分析和文献文字挖掘,揭示了以前未被认识的肿瘤亚类和可用作癌症生物标记物的一系列基因。通过这种方法,我们认为基于微阵列的多类分子分析可以成为癌症生物标志物发现和后续分子癌症诊断的有效工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号