首页> 外文会议>IEEE Congress on Evolutionary Computation >Binary Differential Evolution based Feature Selection Method with Mutual Information for Imbalanced Classification Problems
【24h】

Binary Differential Evolution based Feature Selection Method with Mutual Information for Imbalanced Classification Problems

机译:基于二进制差分演化的特征选择方法,具有相互信息的互相信息

获取原文

摘要

The feature selection process aims to eliminate redundant attributes in the data set, thus it leads to improved classification accuracy. The solution to a feature selection problem is challenging due to the ever-increasing data volume. This problem gets more complicated in the case of imbalanced data sets. Most of the traditional feature selection methods weigh on the majority class when selecting the informative feature subset, thus the selected features often get a bias towards the majority class and neglect the significance of the minority class in the whole process, which results in poor classification performance in the case of minority class objects. Multiple evolutionary algorithms based feature selection methods have been introduced in the past but most of them ignore the class imbalance problem while selecting the most informative feature subset. In this article, we propose a binary differential evolution algorithm with Manhattan distance-based mutation, which employs a joint mutual information maximization based feature selection criteria along with a novel class distribution-based weight assignment scheme to tackle the class imbalance problem. In the experimental studies, we have tested the performance of the proposed method on well-known data sets using three widely-used performance metrics (Average Classification Accuracy, F-measure, G-Means). According to the empirical results, the proposed method performs better than its contenders in most of the data sets.
机译:特征选择过程旨在消除数据集中的冗余属性,从而导致提高分类精度。特征选择问题的解决方案由于数据量不断增加而挑战。在数据集的情况下,此问题变得更加复杂。大多数传统特征选择方法在选择信息特征子集时重量大多数类,因此所选的特征通常会对多数阶级进行偏见,忽视整个过程中少数阶级的重要性,导致分类差的绩效差在少数类对象的情况下。基于多个进化算法的特征选择方法已经引入过去,但其中大多数忽略了类别不平衡问题,同时选择最具信息丰富的功能子集。在本文中,我们提出了一种基于曼哈顿距离的突变的二进制差分演进算法,其采用基于联合互动的特征选择标准以及基于新的基于类分布的权重分配方案来解决类别不平衡问题。在实验研究中,我们已经使用三种广泛使用的性能度量(平均分类精度,F测量,G均值)测试了在众所周知的数据集上对众所周知的数据集的表现。根据经验结果,所提出的方法在大多数数据集中表现优于其竞争者。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号