首页> 外文会议>International Conference on Inventive Computing and Informatics >A Novel Filter approach for efficient selection and Small round blue-cell tumor cancer detection using microarray gene expression data
【24h】

A Novel Filter approach for efficient selection and Small round blue-cell tumor cancer detection using microarray gene expression data

机译:一种新的过滤方法,用于使用微阵列基因表达数据的高效选择和小圆形蓝细胞肿瘤癌检测

获取原文

摘要

Feature selection is a vital task in machine learning and data mining to reduce the dimensionality of the data and also improves the classification performance of an algorithm in terms of high precision, low computational cost, and low vulnerability. Various many technologies have been successfully applied in the previous experimental studies for tumor detection. The foremost challenging task of gene selection method is extracting informative genes contribution in the classification from the DNA microarray datasets with lesser computational load. In this paper, we propose the conglomeration of the Kendall Correlation (KC) and Filter based Feature Selection (FS) method for better classification and prediction. We demonstrate the extensive comparison of the effect of Kendall Correlation with FS methods, using Relief-F, Joint Mutual Information (JMI), and Mutual Information based feature selection (MIFS), Conditional mutual information maximization (CMIM), and Max-relevance & Min-Redundancy (mRMR). To measure the classification performance of four diverse supervised classifiers K-Nearest Neighbor (KNN), Support Vector Machines (SVM), Naive Bayes (NB), and Decision Tree (DT) have been used on small round blue cell tumors (SRBCT) dataset. The result demonstrates that Kendall Correlation in accumulation with mRMR performances better than other combinations.
机译:特征选择是机器学习和数据挖掘中的重要任务,以减少数据的维度,并在高精度,低计算成本和低漏洞方面提高算法的分类性能。在先前的肿瘤检测实验研究中成功地应用了各种技术。基因选择方法的最重要挑战性任务是提取具有较小计算负荷的DNA微阵列数据集的分类中的信息基因贡献。在本文中,我们提出了KENDALL相关(KC)和基于滤波器的特征选择(FS)方法的集合,以获得更好的分类和预测。我们展示了与FS方法的kendall相关性的大量比较,使用reasif-f,联合互信息(JMI)和相互信息的特征选择(MIFS),条件相互信息最大化(CMIM)和最大相关性&最小冗余(MRMR)。为了测量四个不同的监督分类器K-最近邻(knn),支持向量机(SVM),Naive Bayes(Nb)和决策树(DT)的分类性能已被用于小圆形蓝细胞肿瘤(SRBCT)数据集。结果表明,在MRMR性能比其他组合的情况下积累的KENDALL相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号