...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A reduced universum twin support vector machine for class imbalance learning
【24h】

A reduced universum twin support vector machine for class imbalance learning

机译:用于类别不平衡学习的减少的Universum双胞胎支持向量机

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In most of the real world datasets, there is an imbalance in the number of samples belonging to different classes. Various pattern classification problems such as fault or disease detection involve class imbalanced data. The support vector machine (SVM) classifier becomes biased towards the majority class due to class imbalance. Moreover, in the existing SVM based techniques for class imbalance, there is no information about the distribution of data. Motivated by the idea of prior information about data distribution, a reduced universum twin support vector machine for class imbalance learning (RUTSVM-CIL) is proposed in this paper. For the first time, universum learning is incorporated with SVM to solve the problem of class imbalance. Oversampling and undersampling of data is performed to remove the imbalance in the classes. The universum data points are used to give prior information about the data. To reduce the computation time of our universum based algorithm, we use a small sized rectangular kernel matrix. The reduced kernel matrix needs less storage space, and thus applicable for large scale imbalanced datasets. Comprehensive experimentation is performed on various synthetic, real world and large scale imbalanced datasets. In comparison to the existing approaches for class imbalance, the proposed RUTSVM-CIL gives better generalization performance for most of the benchmark datasets. Also, the computation cost of RUTSVM-CIL is very less, making it suitable for real world applications. (C) 2020 Elsevier Ltd. All rights reserved.
机译:在大多数真实的世界数据集中,属于不同类别的样本数量不平衡。诸如故障或疾病检测等各种模式分类问题涉及类别的不平衡数据。由于类别不平衡,支持向量机(SVM)分类器朝向多数类偏置。此外,在基于SVM的基于SVM技术的类别不平衡的技术中,没有关于数据分发的信息。通过关于数据分布的先前信息的主旨,提出了一种用于类别不平衡学习(RUTSVM-CIL)的减少的Universum双支持向量机。首次,Universum学习被SVM融入,解决类别不平衡的问题。对数据的过采样和缺点进行了应用,以消除类中的不平衡。 Universum数据点用于提供有关数据的先前信息。为了减少基于Universum的算法的计算时间,我们使用小型矩形矩形矩阵。减少的内核矩阵需要较少的存储空间,因此适用于大规模的不平衡数据集。对各种合成,现实世界和大规模不平衡数据集进行综合实验。与现有的类别不平衡方法相比,所提出的RUTSVM-CIL为大多数基准数据集提供了更好的泛化性能。此外,RUTSVM-CIL的计算成本非常少,使其适用于现实世界应用。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号