...
首页> 外文期刊>IEEE signal processing letters >Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
【24h】

Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification

机译:深度卷积神经网络和环境增强分类的数据增强

获取原文
获取原文并翻译 | 示例
           

摘要

The ability of deep convolutional neural networks (CNNs) to learn discriminative spectro-temporal patterns makes them well suited to environmental sound classification. However, the relative scarcity of labeled data has impeded the exploitation of this family of high-capacity models. This study has two primary contributions: first, we propose a deep CNN architecture for environmental sound classification. Second, we propose the use of audio data augmentation for overcoming the problem of data scarcity and explore the influence of different augmentations on the performance of the proposed CNN architecture. Combined with data augmentation, the proposed model produces state-of-the-art results for environmental sound classification. We show that the improved performance stems from the combination of a deep, high-capacity model and an augmented training set: this combination outperforms both the proposed CNN without augmentation and a “shallow” dictionary learning model with augmentation. Finally, we examine the influence of each augmentation on the model's classification accuracy for each class, and observe that the accuracy for each class is influenced differently by each augmentation, suggesting that the performance of the model could be improved further by applying class-conditional data augmentation.
机译:深度卷积神经网络(CNN)学习判别性光谱时态模式的能力使其非常适合于环境声分类。但是,标记数据的相对稀缺性阻碍了这一大容量模型系列的开发。这项研究有两个主要贡献:首先,我们提出了一种用于环境声音分类的深层CNN架构。其次,我们提出了使用音频数据增强来克服数据稀缺性的问题,并探讨了不同增强对所提出的CNN体​​系结构性能的影响。结合数据增强,所提出的模型为环境声音分类提供了最新的结果。我们表明,改进的性能源于深度,高容量的模型和增强的训练集的组合:这种组合优于不带增强的拟议CNN和带增强的“浅”字典学习模型。最后,我们检查了每个扩充对每个类别的模型分类准确性的影响,并观察到每个类别的准确性受每个扩充的影响不同,这表明通过应用类别条件数据可以进一步提高模型的性能增强。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号