【24h】

Deep Over-sampling Framework for Classifying Imbalanced Data

机译:深度过度采样框架,用于分类不平衡数据

获取原文

摘要

Class imbalance is a challenging issue in practical classification problems for deep learning models as well as traditional models. Traditionally successful countermeasures such as synthetic over-sampling have had limited success with complex, structured data handled by deep learning models. In this paper, we propose Deep Over-sampling (DOS), a framework for extending the synthetic over-sampling method to the deep feature space acquired by a convolutional neural network (CNN). Its key feature is an explicit, supervised representation learning, for which the training data presents each raw input sample with a synthetic embedding target in the deep feature space, which is sampled from the linear subspace of in-class neighbors. We implement an iterative process of training the CNN and updating the targets, which induces smaller in-class variance among the embeddings, to increase the discriminative power of the deep representation. We present an empirical study using public bench-marks, which shows that the DOS framework not only counteracts class imbalance better than the existing method, but also improves the performance of the CNN in the standard, balanced settings.
机译:类别失衡是深度学习模型的实际分类问题以及传统模式的具有挑战性问题。传统上成功的对策,例如合成过采样的成功与深层学习模型处理的复杂,结构化数据有限。在本文中,我们提出了深度过度采样(DOS),该框架将合成过采样方法扩展到由卷积神经网络(CNN)获取的深度特征空间。其关键特征是一种明确的监督表示学习,培训数据将每个原始输入样本呈现具有在深度特征空间中的合成嵌入目标的原始输入样本,该目标是从课堂内邻居的线性子空间中采样。我们实施培训CNN的迭代过程并更新目标,这在嵌入中诱导较小的课堂方差,以增加深度表示的辨别力。我们使用公共板块标记提出了一个实证研究,这表明DOS框架不仅比现有方法更好地抵消类别不平衡,而且还可以提高CNN在标准平衡设置中的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号