【24h】

Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles

机译:使用SVM集成提高不平衡数据集的预测准确性

获取原文
获取原文并翻译 | 示例

摘要

Learning from imbalanced datasets is inherently difficult due to lack of information about the minority class. In this paper, we study the performance of SVMs, which have gained great success in many real applications, in the imbalanced data context. Through empirical analysis, we show that SVMs suffer from biased decision boundaries, and that their prediction performance drops dramatically when the data is highly skewed. We propose to combine an integrated sampling technique with an ensemble of SVMs to improve the prediction performance. The integrated sampling technique combines both over-sampling and under-sampling techniques. Through empirical study, we show that our method outperforms individual SVMs as well as several other state-of-the-art classifiers.
机译:由于缺乏有关少数群体的信息,因此从不平衡的数据集中学习非常困难。在本文中,我们研究了SVM的性能,在不平衡的数据环境中,SVM在许多实际应用中均获得了巨大的成功。通过经验分析,我们显示SVM受偏向决策边界的影响,当数据高度偏斜时,其预测性能会急剧下降。我们建议将集成采样技术与SVM集成相结合,以提高预测性能。集成采样技术结合了过采样和欠采样技术。通过经验研究,我们证明了我们的方法优于单个SVM以及其他几个最新的分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号