首页> 外文期刊>Procedia Computer Science >A Feature Selection Algorithm Based on Qualitative Mutual Information for Cancer Microarray Data
【24h】

A Feature Selection Algorithm Based on Qualitative Mutual Information for Cancer Microarray Data

机译:基于定性互信息的癌症微阵列数据特征选择算法

获取原文
       

摘要

This paper develops a new algorithm which selects the appropriate feature set and overcomes the challenges with microarray data. Initially, it balances the dataset and then combines the importance score obtained from random forest and mutual information to develop the new technique. Adding importance score of each feature along with mutual information is qualitative mutual information (QMI). In this study, an experiment has been performed which compares the final features reduced from the algorithm with the feature subset found using importance score obtained from random forest and proposed QMI approach. The comparison has been made in terms of number of features selected and classification accuracy from three different classifiers, Na?ve Bayes, C4.5 and IB1. The results depict that the proposed algorithm effectively reduces the features and improves the classification. The experiment also proves that combing importance score from random forest with mutual information is more effective than applying them individually.
机译:本文开发了一种新算法,该算法选择合适的特征集并克服了微阵列数据的挑战。最初,它平衡数据集,然后结合从随机森林和相互信息中获得的重要性得分来开发新技术。与互信息一起添加每个功能的重要性得分是定性互信息(QMI)。在这项研究中,已进行了一项实验,该实验将算法减少的最终特征与使用从随机森林和拟议的QMI方法获得的重要性得分找到的特征子集进行比较。根据选择的特征数量和来自三个不同分类器(朴素贝叶斯,C4.5和IB1)的分类准确性进行了比较。结果表明,该算法有效地减少了特征,提高了分类效率。实验还证明,将随机森林中的重要性得分与相互信息相结合比单独应用它们更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号