...
首页> 外文期刊>Expert Systems with Application >Improving classification accuracy using data augmentation on small data sets
【24h】

Improving classification accuracy using data augmentation on small data sets

机译:使用小数据集上的数据增强提高分类准确性

获取原文
获取原文并翻译 | 示例
           

摘要

Data augmentation (DA) is a key element in the success of Deep Learning (DL) models, as its use can lead to better prediction accuracy values when large size data sets are used. DA was not very much used with earlier neural network models before 2012, and the reason might be related to the type of models and the size of the data sets used. We investigate in this work, applying several state-of-the-art models based on Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs), the effect of DA when using small size data sets, analyzing the results in terms of the prediction accuracy obtained according to the different characteristics of the training samples (number of instances and features, and class unbalance degree). We further introduce modifications to the standard methods used to generate the synthetic samples to alter the class balance representation, and the overall results indicate that with some computational effort a significant increase in prediction accuracy can be obtained when small data sets are considered. (C) 2020 Elsevier Ltd. All rights reserved.
机译:数据增强(DA)是深度学习(DL)模型成功的关键元素,因为当使用大尺寸数据集时,它的使用可能导致更好的预测精度值。 DA在2012年之前与早期的神经网络模型没有很多使用,并且原因可能与模型类型和所用数据集的大小相关。我们在这项工作中调查,应用基于变分的自动化器(VAES)和生成的对冲网络(GANS)的多种最先进的模型,在使用小尺寸数据集时DA的效果,从预测方面分析结果根据训练样本的不同特征获得的准确性(实例和特征的数量和类别不平衡程度)。我们进一步引入了用于生成合成样本的标准方法的修改以改变类平衡表示,并且整体结果表明,在考虑小数据集时,可以获得一些计算工作,可以获得预测精度的显着增加。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号