首页> 外文期刊>International Journal of Computational Intelligence and Applications >A Comprehensive Comparison on Evolutionary Feature Selection Approaches to Classification
【24h】

A Comprehensive Comparison on Evolutionary Feature Selection Approaches to Classification

机译:分类进化特征选择方法的综合比较

获取原文
获取原文并翻译 | 示例
       

摘要

Feature selection is an important data preprocessing step in machine learning and data mining, such as classification tasks. Research on feature selection has been extensively conducted for more than 50 years and different types of approaches have been proposed, which include wrapper approaches or filter approaches, and single objective approaches or multi-objective approaches. However, the advantages and disadvantages of such approaches have not been thoroughly investigated. This paper provides a comprehensive study on comparing different types of feature selection approaches, specifically including comparisons on the classification performance and computational time of wrappers and filters, generality of wrapper approaches, and comparisons on single objective and multi-objective approaches. Particle swarm optimization (PSO)-based approaches, which include different types of methods, are used as typical examples to conduct this research. A total of 10 different feature selection methods and over 7000 experiments are involved. The results show that filters are usually faster than wrappers, but wrappers using a simple classification algorithm can be faster than filters. Wrappers often achieve better classification performance than filters. Feature subsets obtained from wrappers can be general to other classification algorithms. Meanwhile, multi-objective approaches are generally better choices than single objective algorithms. The findings are not only useful for researchers to develop new approaches to addressing new challenges in feature selection, but also useful for real-world decision makers to choose a specific feature selection method according to their own requirements.
机译:特征选择是机器学习和数据挖掘(例如分类任务)中重要的数据预处理步骤。关于特征选择的研究已经进行了50多年,并且提出了不同类型的方法,包括包装方法或过滤器方法以及单目标方法或多目标方法。但是,这种方法的优缺点尚未得到彻底研究。本文对比较不同类型的特征选择方法进行了全面的研究,特别是包括对包装器和过滤器的分类性能和计算时间进行比较,包装器方法的一般性以及单目标和多目标方法的比较。基于粒子群优化(PSO)的方法(包括不同类型的方法)被用作进行此研究的典型示例。总共涉及10种不同的特征选择方法和7000多个实验。结果表明,过滤器通常比包装器更快,但是使用简单分类算法的包装器可以比过滤器更快。包装器通常比过滤器具有更好的分类性能。从包装器获得的特征子集可以是其他分类算法的通用子集。同时,与单目标算法相比,多目标方法通常是更好的选择。这些发现不仅对研究人员开发出解决特征选择新挑战的新方法很有用,而且对于现实世界的决策者根据自己的需求选择特定的特征选择方法也非常有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号