首页> 外文会议>International Conference on Computer Science and Education >A Many-objective Feature Selection Algorithm for Multi-label Classification Based on Computational Complexity of Features
【24h】

A Many-objective Feature Selection Algorithm for Multi-label Classification Based on Computational Complexity of Features

机译:基于特征计算复杂度的多目标分类多目标特征选择算法

获取原文

摘要

Multi-label classification constructs a model on instances which are associated to a set of labels. Similar to traditional single-label classification, redundant and irrelevant features degrade the performance of classification in terms of multiple criteria. So, feature selection task can be modeled as optimizing several conflicting objectives in a large search space simultaneously. Minimizing the number of features and the error of classification are two well-known objectives which are considered in several multi-objective feature selection methods. In addition of these objectives, the computational complexity of features is one of the most important properties of selected features which should be minimized as a crucial objective. As a result, it is desired to select less complex features while offer a higher classification accuracy. The key contribution of this paper is proposing a many-objective optimization method, for first time, to select best subset of features for multi-label data based on four objectives including number of features, two classification error measures (i.e., Hamming loss and Ranking loss), and the complexity of selected features. The defined many-objective optimization problem is solved using a proposed binary version of NSGA-III algorithm. In order to evaluate the proposed algorithm (i.e., binary NSGA-III), a benchmarking is conducted on eight multi-label datasets in terms of several multi-objective assessment. Experimental results show significant improvements for proposed method in comparison with NSGA-II approach.
机译:多标签分类在与一组标签相关联的实例上构建模型。与传统的单标签分类类似,冗余和不相关的功能会根据多个标准降低分类性能。因此,可以将特征选择任务建模为在大型搜索空间中同时优化多个冲突目标。最小化特征数量和分类错误是两个众所周知的目标,这在几种多目标特征选择方法中已得到考虑。除了这些目标之外,要素的计算复杂度也是所选要素的最重要属性之一,应将其最小化作为关键目标。结果,期望选择较少复杂的特征同时提供较高的分类精度。本文的主要贡献是首次提出了一种多目标优化方法,该方法基于四个目标(包括特征数量,两个分类误差度量(即汉明损失和排名))为多标签数据选择最佳特征子集损失),以及所选功能的复杂性。使用提出的NSGA-III二进制版本的算法可以解决已定义的多目标优化问题。为了评估所提出的算法(即二进制NSGA-III),根据几个多目标评估对八个多标签数据集进行了基准测试。实验结果表明,与NSGA-II方法相比,该方法具有明显的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号