...
首页> 外文期刊>Neurocomputing >MFC-GAN: Class-imbalanced dataset classification using Multiple Fake Class Generative Adversarial Network
【24h】

MFC-GAN: Class-imbalanced dataset classification using Multiple Fake Class Generative Adversarial Network

机译:MFC-GAN:使用多个伪类生成对抗网络的类不平衡数据集分类

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Class-imbalanced datasets are common across different domains such as health, banking, security and others. With such datasets, the learning algorithms are often biased toward the majority class-instances. Data augmentation is a common approach that aims at rebalancing a dataset by injecting more data samples of the minority class instances. In this paper, a new data augmentation approach is proposed using a Generative Adversarial Networks (GAN) to handle the class imbalance problem. Unlike common GAN models, which use a single fake class, the proposed method uses multiple fake classes to ensure a fine-grained generation and classification of the minority class instances. Moreover, the proposed GAN model is conditioned to generate minority class instances aiming at rebalancing the dataset. Extensive experiments were carried out using public datasets, where synthetic samples generated using our model were added to the imbalanced dataset, followed by performing classification using Convolutional Neural Network. Experiment results show that our model can generate diverse minority class instances, even in extreme cases where the number of minority class instances is relatively low. Additionally, superior performance of our model over other common augmentation and oversampling methods was achieved in terms of classification accuracy and quality of the generated samples. (C) 2019 Elsevier B.V. All rights reserved.
机译:类不平衡的数据集在不同领域(例如健康,银行业务,安全性和其他领域)通用。对于这样的数据集,学习算法通常偏向多数类实例。数据扩充是一种常见的方法,旨在通过注入少数类实例的更多数据样本来重新平衡数据集。在本文中,提出了一种使用生成对抗网络(GAN)来处理类不平衡问题的新数据增强方法。与使用单个伪类的普通GAN模型不同,该方法使用多个伪类来确保少数类实例的细粒度生成和分类。此外,所提出的GAN模型的条件是生成旨在重新平衡数据集的少数类实例。使用公共数据集进行了广泛的实验,其中将使用我们的模型生成的合成样本添加到不平衡数据集中,然后使用卷积神经网络进行分类。实验结果表明,即使在少数类实例数量相对较少的极端情况下,我们的模型也可以生成各种少数类实例。此外,就分类准确性和生成样本的质量而言,我们的模型优于其他常见的增强和过采样方法。 (C)2019 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2019年第7期|212-221|共10页
  • 作者

    Ali-Gombe Adamu; Elyan Eyad;

  • 作者单位

    Robert Gordon Univ Sch Comp Sci & Digital Media Aberdeen Scotland;

    Robert Gordon Univ Sch Comp Sci & Digital Media Aberdeen Scotland|Robert Gordon Univ Higher Educ Acad Aberdeen Scotland;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Image classification; Imbalanced data; Deep learning;

    机译:图像分类;数据不平衡;深度学习;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号