首页> 外文OA文献 >An Interval-based Multiobjective Approach to Feature Subset Selection Using Joint Modeling of Objectives and Variables
【2h】

An Interval-based Multiobjective Approach to Feature Subset Selection Using Joint Modeling of Objectives and Variables

机译:基于区间和目标变量联合建模的多目标特征子集选择方法

摘要

This paper studies feature subset selection in classification using a multiobjective estimation of distribution algorithm. We consider six functions, namely area under ROC curve, sensitivity, specificity, precision, F1 measure and Brier score, for evaluation of feature subsets and as the objectives of the problem. One of the characteristics of these objective functions is the existence of noise in their values that should be appropriately handled during optimization. Our proposedudalgorithm consists of two major techniques which are specially designed for the feature subset selection problem. The first one is a solution ranking method based on interval values to handle the noise in the objectives of this problem. The second one is a model estimation method for learning a joint probabilistic model of objectives and variables which is used to generate new solutions and advance through the search space. To simplify model estimation, l1 regularized regression is used to select audsubset of problem variables before model learning. The proposed algorithm is compared with a well-known ranking method for interval-valued objectives and a standard multiobjective genetic algorithm. Particularly, the effects of the two new techniques are experimentally investigated. The experimental results show that the proposed algorithm is able to obtain comparable or betterudperformance on the tested datasets.
机译:本文利用分布算法的多目标估计研究分类中的特征子集选择。我们考虑六个功能,即ROC曲线下的面积,灵敏度,特异性,精度,F1度量和Brier得分,以评估特征子集并作为问题的目标。这些目标函数的特征之一是在优化过程中应适当处理的噪声值存在。我们提出的 udalgorithm由两种主要技术组成,这些技术是专门为特征子集选择问题设计的。第一种是基于间隔值的解决方案排名方法,以解决此问题中的噪声。第二种是用于学习目标和变量的联合概率模型的模型估计方法,该模型用于生成新解并在搜索空间中前进。为了简化模型估计,在模型学习之前,使用11正则化回归来选择问题变量的子集。将该算法与著名的区间值目标排序方法和标准多目标遗传算法进行了比较。特别是,对两种新技术的效果进行了实验研究。实验结果表明,该算法能够在测试数据集上获得可比的或更好的 ud表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号