...
首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >A genetic algorithm-based method for feature subset selection
【24h】

A genetic algorithm-based method for feature subset selection

机译:基于遗传算法的特征子集选择方法

获取原文
获取原文并翻译 | 示例
           

摘要

As a commonly used technique in data preprocessing, feature selection selects a subset of informative attributes or variables to build models describing data. By removing redundant and irrelevant or noise features, feature selection can improve the predictive accuracy and the comprehensibility of the predictors or classifiers. Many feature selection algorithms with different selection criteria has been introduced by researchers. However, it is discovered that no single criterion is best for all applications. In this paper, we propose a framework based on a genetic algorithm (GA) for feature subset selection that combines various existing feature selection methods. The advantages of this approach include the ability to accommodate multiple feature selection criteria and find small subsets of features that perform well for a particular inductive learning algorithm of interest to build the classifier. We conducted experiments using three data sets and three existing feature selection methods. The experimental results demonstrate that our approach is a robust and effective approach to find subsets of features with higher classification accuracy and/or smaller size compared to each individual feature selection algorithm.
机译:作为数据预处理中的一种常用技术,特征选择选择信息性属性或变量的子集来构建描述数据的模型。通过删除冗余和无关或噪声特征,特征选择可以提高预测准确性和预测变量或分类变量的可理解性。研究人员介绍了许多具有不同选择标准的特征选择算法。但是,发现没有一个单独的标准对所有应用程序都是最佳的。在本文中,我们提出了一种基于遗传算法(GA)的特征子集选择框架,该框架结合了各种现有特征选择方法。这种方法的优点包括能够容纳多个特征选择标准,并找到对于感兴趣的特定归纳学习算法表现良好的小特征子集,以构建分类器。我们使用三个数据集和三个现有特征选择方法进行了实验。实验结果表明,与每个单独的特征选择算法相比,我们的方法是一种功能强大且有效的方法,可以找到具有更高分类精度和/或更小尺寸的特征子集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号