首页> 外文会议>International conference on multimedia modeling >Multi-channel Convolutional Neural Networks with Multi-level Feature Fusion for Environmental Sound Classification
【24h】

Multi-channel Convolutional Neural Networks with Multi-level Feature Fusion for Environmental Sound Classification

机译:具有多级特征融合的多通道卷积神经网络用于环境声分类

获取原文
获取外文期刊封面目录资料

摘要

Learning acoustic models directly from the raw waveform is an effective method for Environmental Sound Classification (ESC) where sound events often exhibit vast diversity in temporal scales. Convolutional neural networks (CNNs) based ESC methods have achieved the state-of-the-art results. However, their performance is affected significantly by the number of convolutional layers used and the choice of the kernel size in the first convolutional layer. In addition, most existing studies have ignored the ability of CNNs to learn hierarchical features from environmental sounds. Motivated by these findings, in this paper, parallel convolutional filters with different sizes in the first convolutional layer are designed to extract multi-time resolution features aiming at enhancing feature representation. Inspired by VGG networks, we build our deep CNNs by stacking 1-D convolutional layers using very small filters except for the first layer. Finally, we extend the model using multi-level feature aggregation technique to boost the classification performance. The experimental results on Urbansound 8k, ESC-50, and ESC-10 show that our proposed method outperforms the state-of-the-art end-to-end methods for environmental sound classification in terms of the classification accuracy.
机译:直接从原始波形学习声学模型是一种有效的环境声音分类(ESC)方法,其中声音事件通常在时间尺度上表现出极大的多样性。基于卷积神经网络(CNN)的ESC方法已经达到了最新水平。但是,它们的性能会受到所使用的卷积层数和第一卷积层中内核大小的选择的显着影响。此外,大多数现有研究都忽略了CNN从环境声音中学习层次特征的能力。基于这些发现,本文在第一卷积层中设计了大小不同的并行卷积滤波器,以提取旨在增强特征表示的多次分辨率特征。受VGG网络的启发,我们通过使用除第一层以外的非常小的滤镜堆叠一维卷积层来构建深层的CNN。最后,我们使用多级特征聚合技术扩展模型以提高分类性能。在Urbansound 8k,ESC-50和ESC-10上的实验结果表明,在分类准确度方面,我们提出的方法优于最新的端到端环境声音分类方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号