首页> 外文期刊>Applied Soft Computing >Dealing with high-dimensional class-imbalanced datasets: Embedded feature selection for SVM classification
【24h】

Dealing with high-dimensional class-imbalanced datasets: Embedded feature selection for SVM classification

机译:处理高维类别 - 不平衡数据集:SVM分类的嵌入式功能选择

获取原文
获取原文并翻译 | 示例
       

摘要

In this work, we propose a novel feature selection approach designed to deal with two major issues in machine learning, namely class-imbalance and high dimensionality. The proposed embedded strategy penalizes the cardinality of the feature set via the scaling factors technique, and is used with two support vector machine (SVM) formulations designed to deal with the class-imbalanced problem, namely Cost Sensitive SVM, and Support Vector Data Description. The proposed concave formulations are solved via a Quasi-Newton update and Armijo line search. We performed experiments on 12 highly imbalanced microarray datasets using linear and Gaussian kernel, achieving the highest average predictive performance with our approach compared with the most well-known feature selection strategies. (C) 2018 Elsevier B.V. All rights reserved.
机译:在这项工作中,我们提出了一种新颖的特征选择方法,旨在处理机器学习中的两个主要问题,即类别不平衡和高维度。 拟议的嵌入式策略通过缩放因子技术惩罚特征集的基数,并与两个支持向量机(SVM)配方一起使用,该配方旨在处理类别 - 不平衡问题,即成本敏感的SVM,以及支持向量数据描述。 所提出的凹版制剂通过准牛顿更新和ARMIJO线搜索来解决。 我们使用线性和高斯内核对12个高度不平衡的微阵列数据集进行了实验,与我们的方法相比,实现了最高的平均预测性能与最着名的特征选择策略相比。 (c)2018 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号