首页> 外文期刊>International Journal of Engineering Science and Technology >A NOVEL HYBRID METHOD FOR GENE SELECTION IN MICROARRAY BASED CANCER CLASSIFICATION
【24h】

A NOVEL HYBRID METHOD FOR GENE SELECTION IN MICROARRAY BASED CANCER CLASSIFICATION

机译:基于微阵列的癌症分类中一种新的基因选择杂交方法

获取原文
           

摘要

Microarray technology allows researchers to measure the expression levels of many thousands of genes in a single experiment. One of the main applications of the microarray technology is cancer classification in which the problem associated with the analysis is to deal with high number of gene expression data that makes the processing more complex and time consuming. This paper aims to develop a classification algorithm by employing a hybrid method for gene selection. The hybrid method uses Analysis of variance (ANOVA) statistical technique and Principal Component Analysis (PCA) method that overcomes the problem by reducing the gene expression data in to minimal number of gene subsets (Differentially Expressed genes). Then k-nearest neighbors (KNN) classifier is applied on those gene subsets for cancer classification. Three publically available datasets namely leukemia, non-small lung and ovarian cancer data sets are used for classification. Experimental results shows that, the cosine and pearsons correlation distance function with ties rule provide higher classification accuracy and less misclassification error for those differentially expressed genes using different validation methods in KNN classifier.
机译:微阵列技术使研究人员可以在单个实验中测量成千上万个基因的表达水平。微阵列技术的主要应用之一是癌症分类,其中与分析相关的问题是处理大量基因表达数据,这使得处理更加复杂且耗时。本文旨在通过采用混合方法进行基因选择来开发分类算法。混合方法使用方差分析(ANOVA)统计技术和主成分分析(PCA)方法,该方法通过将基因表达数据减少到最小数量的基因子集(差异表达基因)来克服该问题。然后,将k最近邻居(KNN)分类器应用于这些基因子集以进行癌症分类。使用三个可公开获得的数据集,即白血病,非小肺癌和卵巢癌数据集进行分类。实验结果表明,在KNN分类器中使用不同的验证方法,带有差异规则的余弦和梨子相关距离函数可以为那些差异表达基因提供更高的分类准确度,并减少错误分类错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号