首页> 外文会议>IEEE International Workshop on Machine Learning for Signal Processing >Environmental sound classification with convolutional neural networks
【24h】

Environmental sound classification with convolutional neural networks

机译:卷积神经网络环境声音分类

获取原文

摘要

This paper evaluates the potential of convolutional neural networks in classifying short audio clips of environmental sounds. A deep model consisting of 2 convolutional layers with max-pooling and 2 fully connected layers is trained on a low level representation of audio data (segmented spectrograms) with deltas. The accuracy of the network is evaluated on 3 public datasets of environmental and urban recordings. The model outperforms baseline implementations relying on mel-frequency cepstral coefficients and achieves results comparable to other state-of-the-art approaches.
机译:本文评估了卷积神经网络在分类环境声音的短音频剪辑中的潜力。由具有最大池和2个完全连接的层组成的2个卷积层组成的深层模型,接受了具有增量的音频数据(分段谱图)的低级表示。在3个环境和城市记录的3个公共数据集中评估网络的准确性。该模型优于依赖于熔融频率谱系数的基线实现,实现与其他最先进的方法相当的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号