...
首页> 外文期刊>Applied Soft Computing >A novel hybrid genetic algorithm with granular information for feature selection and optimization
【24h】

A novel hybrid genetic algorithm with granular information for feature selection and optimization

机译:一种新型混合遗传算法,具有特征选择和优化的粒度信息

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Feature selection has been a significant task for data mining and pattern recognition. It aims to choose the optimal feature subset with the minimum redundancy and the maximum discriminating ability. This paper analyzes the feature selection method from two aspects of data and algorithm. In order to deal with the redundant features and irrelevant features in high-dimensional & low-sample data and low-dimensional & high-sample data, the feature selection algorithm model based on the granular information is presented in this paper. Thus, our research examines experimentally how granularity level affects both the classification accuracy and the size of feature subset for feature selection. First of all, the improved binary genetic algorithm with feature granulation (IBGAFG) is used to select the significant features. Then, the improved neighborhood rough set with sample granulation (INRSG) is proposed under different granular radius, which further improves the quality of the feature subset. Finally, in order to find out the optimal granular radius, granularity lambda optimization based on genetic algorithm (ROGA) is presented. The optimal granularity parameters are found adaptively according to the feedback of classification accuracy. The performance of the proposed algorithms is tested upon eleven publicly available data sets and is compared with other supervisory methods or evolutionary algorithms. Additionally, the ROGA algorithm is applied to the enterprise financial dataset, which can select the features that affect the financial status. Experiment results demonstrate that the approaches are efficient and can provide higher classification accuracy using granular information. (c) 2018 Elsevier B.V. All rights reserved.
机译:特征选择是数据挖掘和模式识别的重要任务。它旨在选择具有最小冗余和最大辨别能力的最佳特征子集。本文分析了来自数据和算法的两个方面的特征选择方法。为了处理高维和低样本数据和低维和高样本数据中的冗余功能和无关的功能,本文提出了基于粒度信息的特征选择算法模型。因此,我们的研究通过实验检查了粒度水平如何影响特征选择的分类准确性和特征子集的大小。首先,使用具有特征造粒(IBGAFG)的改进的二进制遗传算法来选择显着的特征。然后,在不同的颗粒半径下提出具有样品造粒(INRSG)的改进的邻域粗糙集,这进一步提高了特征子集的质量。最后,为了找出最佳粒度半径,提出了基于遗传算法(Roga)的粒度Lambda优化。根据分类精度的反馈,自适应地发现最佳粒度参数。提出的算法的性能在11个公开的数据集上测试,并与其他监控方法或进化算法进行比较。此外,Roga算法应用于企业财务数据集,可以选择影响财务状况的功能。实验结果表明,该方法有效,可以使用粒度信息提供更高的分类精度。 (c)2018 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号