首页> 外文期刊>ETRI journal >Acoustic Event Detection in Multichannel Audio Using Gated Recurrent Neural Networks with High‐Resolution Spectral Features
【24h】

Acoustic Event Detection in Multichannel Audio Using Gated Recurrent Neural Networks with High‐Resolution Spectral Features

机译:使用具有高分辨率频谱特征的门控递归神经网络检测多通道音频中的声音事件

获取原文
获取外文期刊封面目录资料

摘要

Recently, deep recurrent neural networks have achieved great success in various machine learning tasks, and have also been applied for sound event detection. The detection of temporally overlapping sound events in realistic environments is much more challenging than in monophonic detection problems. In this paper, we present an approach to improve the accuracy of polyphonic sound event detection in multichannel audio based on gated recurrent neural networks in combination with auditory spectral features. In the proposed method, human hearing perception‐based spatial and spectral‐domain noise‐reduced harmonic features are extracted from multichannel audio and used as high‐resolution spectral inputs to train gated recurrent neural networks. This provides a fast and stable convergence rate compared to long short‐term memory recurrent neural networks. Our evaluation reveals that the proposed method outperforms the conventional approaches.
机译:近年来,深度递归神经网络在各种机器学习任务中都取得了巨大的成功,并且也已应用于声音事件检测。在现实环境中检测时间重叠的声音事件比单音检测问题更具挑战性。在本文中,我们提出了一种基于门控递归神经网络结合听觉频谱特征来提高多声道音频中复音声音事件检测准确性的方法。在提出的方法中,从多通道音频中提取基于人类听觉感知的空间和频谱域噪声降低的谐波特征,并将其用作高分辨率频谱输入以训练门控递归神经网络。与长短期记忆循环神经网络相比,这提供了快速稳定的收敛速度。我们的评估表明,提出的方法优于传统方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号