Recurrent neural networks for polyphonic sound event detection in real life recordings

机译：循环神经网络用于现实录音中的复音声音事件检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present an approach to polyphonic sound event detection in real life recordings based on bi-directional long short term memory (BLSTM) recurrent neural networks (RNNs). A single multilabel BLSTM RNN is trained to map acoustic features of a mixture signal consisting of sounds from multiple classes, to binary activity indicators of each event class. Our method is tested on a large database of real-life recordings, with 61 classes (e.g. music, car, speech) from 10 different everyday contexts. The proposed method outperforms previous approaches by a large margin, and the results are further improved using data augmentation techniques. Overall, our system reports an average F 1-score of 65.5% on 1 second blocks and 64.7% on single frames, a relative improvement over previous state-of-the-art approach of 6.8% and 15.1% respectively.

机译：在本文中，我们提出了一种基于双向长期短期记忆（BLSTM）递归神经网络（RNN）的现实生活录音中的复音声音事件检测方法。训练了一个多标签BLSTM RNN，将混合信号的声学特征（包括来自多个类别的声音）映射到每个事件类别的二进制活动指示符。我们的方法已在一个大型的真实录音数据库中进行了测试，该数据库有来自10种不同日常情况的61种课程（例如音乐，汽车，语音）。所提出的方法在很大程度上优于以前的方法，并且使用数据增强技术进一步改善了结果。总体而言，我们的系统报告的1秒数据块的平均F 1分数为65.5％，单帧图像的平均F 1分数为64.7％，比以前的最新方法分别提高了6.8％和15.1％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年|6440-6444|共5页
会议地点
作者
Giambattista Parascandolo; Heikki Huttunen; Tuomas Virtanen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Recurrent neural network; bidirectional LSTM; deep learning; polyphonic sound event detection;

机译：递归神经网络;双向LSTM;深度学习;和声事件检测;

相似文献

外文文献
中文文献
专利

1. Relational recurrent neural networks for polyphonic sound event detection [J] . Ma Junbo, Wang Ruili, Ji Wanting, Multimedia Tools and Applications . 2019,第20期

机译：关系递归神经网络用于复音事件检测
2. Polyphonic Sound Event Detection Based on Residual Convolutional Recurrent Neural Network With Semi-Supervised Loss Function [J] . Nam Kyun Kim, Hong Kook Kim Quality Control, Transactions . 2021,第1期

机译：基于半监控损失函数的残余卷积复发性神经网络的复音声事件检测
3. Polyphonic Sound Event Detection by Using Capsule Neural Networks [J] . Vesperini Fabio, Gabrielli Leonardo, Principi Emanuele, Selected Topics in Signal Processing, IEEE Journal of . 2019,第2期

机译：使用胶囊神经网络的复音声音事件检测
4. Recurrent neural networks for polyphonic sound event detection in real life recordings [C] . Giambattista Parascandolo, Heikki Huttunen, Tuomas Virtanen IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：现实寿命记录中的复态声音事件检测的经常性神经网络
5. Convolutional and recurrent neural networks for pedestrian detection [D] . Balaji, Vivek Arvind. 2016

机译：用于行人检测的卷积和经常性神经网络
6. Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database—HF_Lung_V1 [O] . Fu-Shun Hsu, Shang-Ran Huang, Chien-Wen Huang, 2021

机译：呼吸阶段呼吸性神经网络变体的基准用于自主开发的开放式肺部声音数据库 - HF_Lung_v1上的呼吸相和不定声探测
7. Recurrent neural networks for polyphonic sound event detection in real life recordings [O] . Giambattista Parascandolo, Heikki Huttunen, Tuomas Virtanen 2016

机译：现实寿命记录中的复态声音事件检测的经常性神经网络

Recurrent neural networks for polyphonic sound event detection in real life recordings

摘要

著录项

相似文献

相关主题

期刊订阅