首页> 外文会议>International Joint Conference on Neural Networks >End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input

【24h】

End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input

机译：使用带学习的时频表示输入的卷积递归神经网络进行端到端复音声音事件检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sound event detection systems typically consist of two stages: extracting hand-crafted features from the raw audio waveform, and learning a mapping between these features and the target sound events using a classifier. Recently, the focus of sound event detection research has been mostly shifted to the latter stage using standard features such as mel spectrogram as the input for classifiers such as deep neural networks. In this work, we utilize end-to-end approach and propose to combine these two stages in a single deep neural network classifier. The feature extraction over the raw waveform is conducted by a feedforward layer block, whose parameters are initialized to extract the time-frequency representations. The feature extraction parameters are updated during training, resulting with a representation that is optimized for the specific task. This feature extraction block is followed by (and jointly trained with) a convolutional recurrent network, which has recently given state-of-the-art results in many sound recognition tasks. The proposed system does not outperform a convolutional recurrent network with fixed hand-crafted features. The final magnitude spectrum characteristics of the feature extraction block parameters indicate that the most relevant information for the given task is contained in 0 - 3 kHz frequency range, and this is also supported by the empirical results on the SED performance.

机译：声音事件检测系统通常包括两个阶段：从原始音频波形中提取手工制作的特征，以及使用分类器学习这些特征与目标声音事件之间的映射。最近，声音事件检测研究的重点已大部分转移到了后期，使用标准特征（如梅尔声谱图）作为分类器（如深度神经网络）的输入。在这项工作中，我们采用了端到端方法，并提出将这两个阶段组合在一个单独的深度神经网络分类器中。通过前馈层模块对原始波形进行特征提取，其参数被初始化以提取时频表示。特征提取参数在训练过程中进行更新，从而得到针对特定任务进行了优化的表示形式。这个特征提取模块之后是一个卷积递归网络（并与之一起训练），该卷积递归网络最近在许多声音识别任务中提供了最新技术成果。拟议的系统不优于具有固定手工特征的卷积递归网络。特征提取块参数的最终幅度谱特征表明，与给定任务最相关的信息包含在0-3 kHz频率范围内，而SED性能的经验结果也支持这一点。

著录项

来源
《International Joint Conference on Neural Networks》|2018年|1-7|共7页
会议地点
作者
Emre Çakir; Tuomas Virtanen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Spectrogram; Time-frequency analysis; Neural networks; Task analysis; Training; Convolution;

机译：特征提取频谱图时频分析神经网络任务分析训练卷积;

相似文献

外文文献
中文文献
专利

1. Polyphonic Sound Event Detection Based on Residual Convolutional Recurrent Neural Network With Semi-Supervised Loss Function [J] . Nam Kyun Kim, Hong Kook Kim Quality Control, Transactions . 2021,第1期

机译：基于半监控损失函数的残余卷积复发性神经网络的复音声事件检测
2. Relational recurrent neural networks for polyphonic sound event detection [J] . Ma Junbo, Wang Ruili, Ji Wanting, Multimedia Tools and Applications . 2019,第20期

机译：关系递归神经网络用于复音事件检测
3. Seismic Event and Phase Detection Using Time-Frequency Representation and Convolutional Neural Networks [J] . Dokht Ramin M. H., Kao Honn, Visser Ryan, Seismological research letters . 2019,第2Appa期

机译：使用时频表示和卷积神经网络的地震事件和相位检测
4. End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input [C] . Emre Cakir, Tuomas Virtanen International Joint Conference on Neural Networks . 2018

机译：使用具有学习的时频表示输入的卷积经常性神经网络的端到端的元音声音检测
5. Convolutional and recurrent neural networks for pedestrian detection [D] . Balaji, Vivek Arvind. 2016

机译：用于行人检测的卷积和经常性神经网络
6. Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones [O] . Nam Kyun Kim, Kwang Myung Jeon, Hong Kook Kim 2019

机译：基于多麦克风的基于卷积递归神经网络的隧道事件检测
7. Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection [O] . Çakır, Emre, Parascandolo, Giambattista, Heittola, Toni, 2017

机译：用于复音声音事件的卷积递归神经网络发现

End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input

摘要

著录项

相似文献

相关主题

期刊订阅