Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles

机译：使用SVM集成提高不平衡数据集的预测准确性

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning from imbalanced datasets is inherently difficult due to lack of information about the minority class. In this paper, we study the performance of SVMs, which have gained great success in many real applications, in the imbalanced data context. Through empirical analysis, we show that SVMs suffer from biased decision boundaries, and that their prediction performance drops dramatically when the data is highly skewed. We propose to combine an integrated sampling technique with an ensemble of SVMs to improve the prediction performance. The integrated sampling technique combines both over-sampling and under-sampling techniques. Through empirical study, we show that our method outperforms individual SVMs as well as several other state-of-the-art classifiers.

机译：由于缺乏有关少数群体的信息，因此从不平衡的数据集中学习非常困难。在本文中，我们研究了SVM的性能，在不平衡的数据环境中，SVM在许多实际应用中均获得了巨大的成功。通过经验分析，我们显示SVM受偏向决策边界的影响，当数据高度偏斜时，其预测性能会急剧下降。我们建议将集成采样技术与SVM集成相结合，以提高预测性能。集成采样技术结合了过采样和欠采样技术。通过经验研究，我们证明了我们的方法优于单个SVM以及其他几个最新的分类器。

著录项

来源
《Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining(PAKDD 2006); 20060409-12; Singapore(SG)》|2006年|P.107-118|共12页
会议地点 Singapore(SG)
作者
Yang Liu; Aijun An; Xiangji Huang;
展开▼
作者单位

Department of Computer Science and Engineering, York University, Toronto, Ontario, M3J 1P3, Canada;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Boosted Near-miss Under-sampling on SVM ensembles for concept detection in large-scale imbalanced datasets [J] . Bao Lei, Juan Cao, Li Jintao, Neurocomputing . 2016,第JANa8期

机译：支持SVM集成的增强型近缺失欠采样，用于大规模不平衡数据集中的概念检测
2. Performance enhanced Boosted SVM for Imbalanced datasets [J] . Sundar R., Punniyamoorthy M. Applied Soft Computing . 2019,第期

机译：性能增强型增强SVM用于非衡度数据集
3. Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting [J] . Sun Jie, Li Hui, Fujita Hamido, Information Fusion . 2020,第期

机译：基于Adaboost-SVM合奏的类级别的动态财务遇险预测结合麦克风和时间加权
4. Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles [C] . Yang Liu, Aijun An, Xiangji Huang Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining . 2006

机译：使用SVM集合促进对不平衡数据集的预测准确性
5. Boosted Feature Selection for Class Dedicated SVM and Its Application in Fetal Health Prediction [D] . Lee, Jinpyo 2019

机译：类专用SVM的增强特征选择及其在胎儿健康预测中的应用
6. Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble [O] . Dong-Jun Yu, Jun Hu, Hui Yan, 2014

机译：多个异质子空间SVM集成增强蛋白质-维生素结合残基的预测
7. Prediction of novel pre-microRNAs with high accuracy through boosting and SVM [O] . Yuanwei Zhang, Yifan Yang, Huan Zhang, 2011

机译：通过升压和SVM预测高精度的新型前微大RNA。

Boosting Prediction Accuracy on Imbalanced Datasets with SVM Ensembles

摘要

著录项

相似文献

相关主题

期刊订阅