首页> 外文会议>2015 IEEE Conference on Collaboration and Internet Computing >Deep Learning with MCA-based Instance Selection and Bootstrapping for Imbalanced Data Classification
【24h】

Deep Learning with MCA-based Instance Selection and Bootstrapping for Imbalanced Data Classification

机译:深度学习与基于MCA的实例选择和自举相关的不平衡数据分类

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose an extended deep learning approach that incorporates instance selection and bootstrapping techniques for imbalanced data classification. In supervised learning, classification performance often deteriorates when the training set is imbalanced where at least one of the classes has a substantially fewer number of instances than the others. We propose to use adaptive synthetic sampling approach (ADASYN) to generate synthetic instances for the minority class. A data pruning process based on multiple correspondence analysis (MCA) is then performed to identify a sub-set of synthetic instances that are most suitable to supplement the existing minority instances. This results in a relatively more balanced training dataset which is then bootstrapped and fed into the convolutional neural networks (CNNs) for classification. Furthermore, we propose to use low-level features pre-processed by principal component analysis (PCA), instead of the commonly used raw signal data, as the input to CNNs to reduce the computational time. The experimental results show the effectiveness of our framework in classifying 54 TRECVID concepts with different imbalanced levels by comparing with other state-of-the-art methods.
机译:在本文中,我们提出了一种扩展的深度学习方法,该方法结合了实例选择和自举技术来实现不平衡的数据分类。在监督学习中,当训练集不平衡时分类性能通常会下降,其中至少一个类别的实例数量明显少于其他类别。我们建议使用自适应综合采样方法(ADASYN)为少数群体类别生成综合实例。然后执行基于多重对应分析(MCA)的数据修剪过程,以识别最适合于补充现有少数实例的合成实例的子集。这导致了一个相对更平衡的训练数据集,然后将其自举并输入到卷积神经网络(CNN)中进行分类。此外,我们建议使用通过主成分分析(PCA)预处理的低级特征,而不是常用的原始信号数据,作为CNN的输入,以减少计算时间。实验结果表明,通过与其他最新方法进行比较,我们的框架在将54种TRECVID概念分类为不同不平衡水平方面是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号