首页> 外文期刊>Journal of Information and Telecommunication >A new penalty-based wrapper fitness function for feature subset selection with evolutionary algorithms
【24h】

A new penalty-based wrapper fitness function for feature subset selection with evolutionary algorithms

机译:一种新的基于惩罚的包装器适应度函数,用于进化算法的特征子集选择

获取原文
           

摘要

Feature subset selection is an important preprocessing task for any real life data mining or pattern recognition problem. Evolutionary computational (EC) algorithms are popular as a search algorithm for feature subset selection. With the classification accuracy as the fitness function, the EC algorithms end up with feature subsets having considerably high recognition accuracy but the number of residual features also remain quite high. For high dimensional data, reduction of number of features is also very important to minimize computational cost of overall classification process. In this work, a wrapper fitness function composed of classification accuracy with another penalty term which penalizes for large number of features has been proposed. The proposed wrapper fitness function is used for feature subset evaluation and subsequent selection of optimal feature subset with several EC algorithms. The simulation experiments are done with several benchmark data sets having small to large number of features. The simulation results show that the proposed wrapper fitness function is efficient in reducing the number of features in the final selected feature subset without significant reduction of classification accuracy. The proposed fitness function has been shown to perform well for high-dimensional data sets with dimension up to 10,000.
机译:对于任何现实数据挖掘或模式识别问题,特征子集选择都是一项重要的预处理任务。进化计算(EC)算法作为特征子集选择的搜索算法很受欢迎。以分类精度作为适应度函数,EC算法最终得到具有相当高识别精度的特征子集,但残存特征的数量也仍然很高。对于高维数据,减少特征数量对于最小化整体分类过程的计算成本也非常重要。在这项工作中,提出了一种由分类精度和另一个惩罚大量特征的惩罚项组成的包装适应度函数。所提出的包装适应度函数用于特征子集评估以及随后通过几种EC算法选择最佳特征子集。仿真实验是使用具有少量或大量特征的几个基准数据集完成的。仿真结果表明,提出的包装适应度函数可以有效地减少最终选择的特征子集中的特征数量,而不会显着降低分类精度。所提出的适应度函数已显示对维数高达10,000的高维数据集表现良好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号