首页> 外文会议>International Conference on Aritificial Neural Networks: Biological Inspirations(ICANN 2005) pt.1; 20050911-15; Warsaw(PL) >Gene Extraction for Cancer Diagnosis by Support Vector Machines An Improvement and Comparison with Nearest Shrunken Centroid Method
【24h】

Gene Extraction for Cancer Diagnosis by Support Vector Machines An Improvement and Comparison with Nearest Shrunken Centroid Method

机译:支持向量机在癌症诊断中的基因提取与最近收缩质心法的比较与比较

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

A cancer diagnosis by using the DNA microarray data faces many challenges the most serious one being the presence of thousands of genes and only several dozens (at the best) of patient's samples. Thus, making any kind of classification in high-dimensional spaces from a limited number of data is both an extremely difficult and a prone to an error procedure. The improved Recursive Feature Elimination with Support Vector Machines (RFE-SVMs) is introduced and used here for an elimination of less relevant genes and just for a reduction of the overall number of genes used in a medical diagnostic. The paper shows why and how the, usually neglected, penalty parameter C influence classification results and the gene selection of RFE-SVMs. With an appropriate parameter C chosen, the reduction in a diagnosis error is as high as 37% on the colon cancer data set. The results suggest that with a properly chosen parameter C, the extracted genes and the constructed classifier will ensure less over-fitting of the training data leading to an increase accuracy in selecting relevant genes.
机译:使用DNA微阵列数据进行癌症诊断面临许多挑战,最严重的挑战是存在数千种基因,而患者样本只有几十个(最好)。因此,根据有限数量的数据在高维空间中进行任何类型的分类既极度困难,又容易出错。本文介绍了使用支持向量机(RFE-SVM)改进的递归特征消除功能,并将其用于消除不相关的基因,以及减少医学诊断中使用的基因总数。本文说明了为什么以及通常被忽略的惩罚参数C影响分类结果以及RFE-SVM的基因选择的原因和方式。选择适当的参数C,在结肠癌数据集上的诊断错误减少率高达37%。结果表明,在正确选择参数C的情况下,提取的基因和构建的分类器将确保较少的训练数据过拟合,从而提高选择相关基因的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号