...
【24h】

A new hybrid feature selection based on multi-filter weights and multi-feature weights

机译:基于多滤波权重和多重特征权重的新混合特征选择

获取原文
获取原文并翻译 | 示例
           

摘要

A traditional feature selection of filters evaluates the importance of a feature by using a particular metric, deducing unstable performances when the dataset changes. In this paper, a new hybrid feature selection (called MFHFS) based on multi-filter weights and multi-feature weights is proposed. Concretely speaking, MFHFS includes the following three stages: Firstly, all samples are normalized and discretized, and the noises and the outliers are removed based on 10-folder cross validation. Secondly, the vector of multi-filter weights and the matrix of multi-feature weights are calculated and used to combine different feature subsets obtained by the optimal filters. Finally, a Q-range based feature relevance calculation method is proposed to measure the relationship of different features and the greedy searching policy is used to filter the redundant features of the temp feature subset to obtain the final feature subset. Experiments are carried out using two typical classifiers of support vector machine and random forest on six datasets (APS, Madelon, CNAE9, Gisette, DrivFace and Amazon). When the measurements of F-1(macro) and F-1(micro) are used, the experimental results show that the proposed method has great improvement on classification accuracy compared to the traditional filters, and it achieves significant improvements on running speed while guaranteeing the classification accuracy compared to typical hybrid feature selections.
机译:传统的滤波器特征选择通过使用特定的度量来评估功能的重要性,当数据集更改时致力于不稳定的性能。在本文中,提出了一种基于多滤波权重和多特征权重的新的混合特征选择(称为MFHF)。具体说话,MFHFS包括以下三个阶段:首先,所有样本都是归一化和离散化的,并且基于10折叠交叉验证去除噪声和异常值。其次,计算多滤波的矢量和多重特征权重的矩阵,并用于组合由最佳滤波器获得的不同特征子集。最后,提出了一种基于Q范围的特征相关性计算方法来测量不同特征的关系,并且使用贪婪搜索策略来过滤临时特征子集的冗余功能以获得最终特征子集。在六个数据集(APS,MadeLon,CNA9,Gisette,Drivface和Amazon上,使用两种典型的支持向量机和随机林进行了实验。当使用F-1(宏)和F-1(微)的测量时,实验结果表明,与传统过滤器相比,该方法对分类准确性的分类精度有很大提高,而且在保证时,它可以实现对运行速度的显着改进与典型的混合特征选择相比,分类准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号