【24h】

Deep Over-sampling Framework for Classifying Imbalanced Data

机译:深度过采样框架,用于对不平衡数据进行分类

获取原文

摘要

Class imbalance is a challenging issue in practical classification problems for deep learning models as well as traditional models. Traditionally successful countermeasures such as synthetic over-sampling have had limited success with complex, structured data handled by deep learning models. In this paper, we propose Deep Over-sampling (DOS), a framework for extending the synthetic over-sampling method to the deep feature space acquired by a convolutional neural network (CNN). Its key feature is an explicit, supervised representation learning, for which the training data presents each raw input sample with a synthetic embedding target in the deep feature space, which is sampled from the linear sub-space of in-class neighbors. We implement an iterative process of training the CNN and updating the targets, which induces smaller in-class variance among the embeddings, to increase the discriminative power of the deep representation. We present an empirical study using public benchmarks, which shows that the DOS framework not only counteracts class imbalance better than the existing method, but also improves the performance of the CNN in the standard, balanced settings.
机译:在深度学习模型和传统模型的实际分类问题中,类不平衡是一个具有挑战性的问题。传统上成功的对策(例如合成过采样)在深度学习模型处理的复杂,结构化数据方面取得的成功有限。在本文中,我们提出了深度过采样(DOS),一种框架,用于将综合过采样方法扩展到卷积神经网络(CNN)所获取的深层特征空间。它的关键特征是显式,有监督的表示学习,为此训练数据将每个原始输入样本呈现给具有深层特征空间的合成嵌入目标,该目标是从类内邻居的线性子空间中采样的。我们实施了训练CNN和更新目标的迭代过程,从而在嵌入之间产生较小的类内差异,从而提高了深度表示的判别力。我们使用公共基准进行了一项实证研究,该研究表明DOS框架不仅比现有方法更好地解决了类不平衡问题,而且还改善了标准,平衡设置下CNN的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号