首页> 美国卫生研究院文献>other >Informative Gene Selection and Direct Classification of Tumor Based on Chi-Square Test of Pairwise Gene Interactions
【2h】

Informative Gene Selection and Direct Classification of Tumor Based on Chi-Square Test of Pairwise Gene Interactions

机译:基于成对基因相互作用的卡方检验的肿瘤信息性基因选择和直接分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

In efforts to discover disease mechanisms and improve clinical diagnosis of tumors, it is useful to mine profiles for informative genes with definite biological meanings and to build robust classifiers with high precision. In this study, we developed a new method for tumor-gene selection, the Chi-square test-based integrated rank gene and direct classifier (χ 2-IRG-DC). First, we obtained the weighted integrated rank of gene importance from chi-square tests of single and pairwise gene interactions. Then, we sequentially introduced the ranked genes and removed redundant genes by using leave-one-out cross-validation of the chi-square test-based Direct Classifier (χ 2-DC) within the training set to obtain informative genes. Finally, we determined the accuracy of independent test data by utilizing the genes obtained above with χ 2-DC. Furthermore, we analyzed the robustness of χ 2-IRG-DC by comparing the generalization performance of different models, the efficiency of different feature-selection methods, and the accuracy of different classifiers. An independent test of ten multiclass tumor gene-expression datasets showed that χ 2-IRG-DC could efficiently control overfitting and had higher generalization performance. The informative genes selected by χ 2-IRG-DC could dramatically improve the independent test precision of other classifiers; meanwhile, the informative genes selected by other feature selection methods also had good performance in χ 2-DC.
机译:为了发现疾病的机制并改善肿瘤的临床诊断,挖掘具有明确生物学意义的信息基因的概况以及建立具有高度精确性的强大分类器非常有用。在这项研究中,我们开发了一种新的肿瘤基因选择方法,基于卡方检验的整合秩基因和直接分类器(χ 2 -IRG-DC)。首先,我们从单基因和成对基因相互作用的卡方检验中获得了基因重要性的加权综合等级。然后,我们在训练集中使用基于卡方检验的直接分类器(χ 2 -DC)的留一法交叉验证,依次引入了排名基因并去除了冗余基因。获得信息丰富的基因。最后,我们利用上面获得的带有χ 2 -DC的基因确定了独立测试数据的准确性。此外,我们通过比较不同模型的泛化性能,不同特征选择方法的效率以及不同分类器的准确性,分析了χ 2 -IRG-DC的鲁棒性。对十个多类肿瘤基因表达数据集的独立测试表明,χ 2 -IRG-DC可以有效地控制过度拟合并具有较高的泛化性能。 χ 2 -IRG-DC选择的信息基因可以显着提高其他分类器的独立测试精度。同时,其他特征选择方法选择的信息基因在χ 2 -DC中也表现良好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号