首页> 外文期刊>Complex & Intelligent Systems >EFS-MI: an ensemble feature selection method for classification
【24h】

EFS-MI: an ensemble feature selection method for classification

机译:EFS-MI:分类的整体特征选择方法

获取原文
获取外文期刊封面目录资料

摘要

Feature selection methods have been used in various applications of machine learning, bioinformatics, pattern recognition and network traffic analysis. In high dimensional datasets, due to redundant features and curse of dimensionality, a learning method takes significant amount of time and performance of the model decreases. To overcome these problems, we use feature selection technique to select a subset of relevant and non-redundant features. But, most feature selection methods are unstable in nature, i.e., for different training datasets, a feature selection method selects different subsets of features that yields different classification accuracy. In this paper, we provide an ensemble feature selection method using feature–class and feature-feature mutual information to select an optimal subset of features by combining multiple subsets of features. The method is validated using four classifiers viz., decision trees, random forests, KNN and SVM on fourteen UCI, five gene expression and two network datasets.
机译:特征选择方法已用于机器学习,生物信息学,模式识别和网络流量分析的各种应用中。在高维数据集中,由于冗余特征和维数的诅咒,一种学习方法会花费大量时间,并且模型的性能会下降。为了克服这些问题,我们使用特征选择技术来选择相关和非冗余特征的子集。但是,大多数特征选择方法本质上是不稳定的,即,对于不同的训练数据集,特征选择方法选择特征的不同子集,从而产生不同的分类精度。在本文中,我们提供了一种使用特征类和特征特征互信息的整体特征选择方法,通过组合多个特征子集来选择最佳特征子集。使用四个分类器(即决策树,随机森林,14个UCI上的KNN和SVM,五个基因表达和两个网络数据集)验证了该方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号