首页> 外文会议>AAAI Conference on Artificial Intelligence >Cost-Sensitive Feature Selection via F-Measure Optimization Reduction
【24h】

Cost-Sensitive Feature Selection via F-Measure Optimization Reduction

机译:经由F测量优化减少的成本敏感特征选择

获取原文
获取外文期刊封面目录资料

摘要

Feature selection aims to select a small subset from the high-dimensional features which can lead to better learning performance, lower computational complexity, and better model readability. The class imbalance problem has been neglected by traditional feature selection methods, therefore the selected features will be biased towards the majority classes. Because of the superiority of F-measure to accuracy for imbalanced data, we propose to use F-measure as the performance measure for feature selection algorithms. As a pseudo-linear function, the optimization of F-measure can be achieved by minimizing the total costs. In this paper, we present a novel cost-sensitive feature selection (CSFS) method which optimizes F-measure instead of accuracy to take class imbalance issue into account. The features will be selected according to optimal F-measure classifier after solving a series of cost-sensitive feature selection sub-problems. The features selected by our method will fully represent the characteristics of not only majority classes, but also minority classes. Extensive experimental results conducted on synthetic, multi-class and multi-label datasets validate the efficiency and significance of our feature selection method.
机译:特征选择旨在从高维功能中选择一个小型子集,这可以导致更好的学习性能,更低的计算复杂性和更好的模型可读性。传统的特征选择方法忽略了类别不平衡问题,因此所选功能将偏向于多数类。由于F测量的优越性,以准确性为不平衡数据,我们建议使用F测量作为特征选择算法的性能测量。作为伪线性函数,可以通过最小化总成本来实现F测量的优化。在本文中,我们提出了一种新的成本敏感特征选择(CSFS)方法,该方法优化F测量,而不是准确性,以考虑课程不平衡问题。在解决一系列成本敏感的特征选择子问题之后,将根据最佳F测量分类器选择该功能。我们的方法选择的功能将完全代表不仅是多数类的特征,也是少数阶级。对合成,多级和多标签数据集进行的广泛实验结果验证了我们特征选择方法的效率和意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号