An Empirical Analysis of Imbalanced Data Classification

首页> 外文期刊>Computer and Information Science >An Empirical Analysis of Imbalanced Data Classification

【24h】

An Empirical Analysis of Imbalanced Data Classification

机译：不平衡数据分类的实证分析

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

SVM has been given top consideration for addressing the challenging problem of data imbalance learning. Here,we conduct an empirical classification analysis of new UCI datasets that have dierent imbalance ratios, sizes andcomplexities. The experimentation consists of comparing the classification results of SVM with two other popularclassifiers, Naive Bayes and decision tree C4.5, to explore their pros and cons. To make the comparative exper-iments more comprehensive and have a better idea about the learning performance of each classifier, we employin total four performance metrics: Sensitive, Specificity, G-means and time-based eciency. For each benchmarkdataset, we perform an empirical search of the learning model through numerous training of the three classifiersunder dierent parameter settings and performance measurements. This paper exposes the most significant resultsi.e. the highest performance achieved by each classifier for each dataset. In summary, SVM outperforms the othertwo classifiers in terms of Sensitive (or Specificity) for all the datasets, and is more accurate in terms of G-meanswhen classifying large datasets.

机译：为了解决数据不平衡学习的挑战性问题，已将SVM作为首要考虑。在这里，我们对具有不同失衡率，规模和复杂性的新UCI数据集进行实证分类分析。实验包括将SVM的分类结果与其他两个流行分类器（朴素贝叶斯和决策树C4.5）进行比较，以探讨其优缺点。为了使比较实验更加全面，并对每个分类器的学习性能有了更好的了解，我们总共采用了四个绩效指标：敏感度，特异性，G均值和基于时间的效率。对于每个基准数据集，我们通过在不同的参数设置和性能测量下对三个分类器进行大量训练来对学习模型进行经验搜索。本文揭示了最重要的结果每个分类器为每个数据集实现的最高性能。总而言之，就所有数据集而言，SVM在敏感度（或特异性）方面都优于其他两个分类器，而在对大型数据集进行分类时，就G均值而言，SVM更准确。

著录项

来源
《Computer and Information Science》 |2015年第1期|共1页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. An Empirical Analysis of Imbalanced Data Classification [J] . Shu Zhang, Samira Sadaoui, Malek Mouhoub Computer and information science . 2015,第1期

机译：不平衡数据分类的实证分析
2. An Empirical Analysis of Imbalanced Data Classification [J] . Shu Zhang, Samira Sadaoui, Malek Mouhoub Computer and Information Science . 2015,第1期

机译：不平衡数据分类的实证分析
3. An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics [J] . López V., Fernández A., García S., Information Sciences: An International Journal . 2013,第Null期

机译：对不平衡数据分类的见解：使用数据固有特征的经验结果和当前趋势
4. An Empirical Comparison of Classification Algorithms for Imbalanced Credit Scoring Datasets [C] . Leopoldo Soares de Melo Junior, Franco Maria Nardini, Chiara Renso, IEEE International Conference on Machine Learning and Applications . 2019

机译：不平衡信用评分数据集分类算法的实证比较
5. Deep Learning Based Imbalanced Data Classification and Information Retrieval for Multimedia Big Data [D] . Yan, Yilin. 2018

机译：基于深度学习的多媒体大数据不平衡数据分类与信息检索
6. Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification [O] . Jinyan Li, Simon Fong, Yunsick Sung, 2016

机译：生物医学数据分类中基于二元不平衡数据集的自适应群聚动态多目标综合少数抽样技术算法
7. An Empirical Analysis of Imbalanced Data Classification [O] . Shu Zhang, Samira Sadaoui, Malek Mouhoub 2015

机译：不平衡数据分类的实证分析

An Empirical Analysis of Imbalanced Data Classification

摘要

著录项

相似文献

相关主题

期刊订阅