首页> 外文会议>Chinese conference pattern recognition and computer vision >Deep Convolutional Neural Network with Mixup for Environmental Sound Classification
【24h】

Deep Convolutional Neural Network with Mixup for Environmental Sound Classification

机译:深度混合的深度卷积神经网络用于环境声音分类

获取原文

摘要

Environmental sound classification (ESC) is an important and challenging problem. In contrast to speech, sound events have noiselike nature and may be produced by a wide variety of sources. In this paper, we propose to use a novel deep convolutional neural network for ESC tasks. Our network architecture uses stacked convolutional and pooling layers to extract high-level feature representations from spectrogram-like features. Furthermore, we apply mixup to ESC tasks and explore its impacts on classification performance and feature distribution. Experiments were conducted on UrbanSound8K, ESC-50 and ESC-10 datasets. Our experimental results demonstrated that our ESC system has achieved the state-of-the-art performance (83.7%) on UrbanSound8K and competitive performance on ESC-50 and ESC-10.
机译:环境声音分类(ESC)是一个重要且具有挑战性的问题。与语音相反,声音事件具有类似噪声的性质,并且可能由多种来源产生。在本文中,我们建议使用新颖的深度卷积神经网络执行ESC任务。我们的网络体系结构使用堆叠的卷积和池化层从类似频谱图的特征中提取高级特征表示。此外,我们将混合应用于ESC任务,并探索其对分类性能和特征分布的影响。在UrbanSound8K,ESC-50和ESC-10数据集上进行了实验。我们的实验结果表明,我们的ESC系统在UrbanSound8K上达到了最先进的性能(83.7%),在ESC-50和ESC-10上也具有竞争优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号