首页> 外文会议>European Signal Processing Conference >A low-latency, real-time-capable singing voice detection method with LSTM recurrent neural networks

【24h】

A low-latency, real-time-capable singing voice detection method with LSTM recurrent neural networks

机译：基于LSTM递归神经网络的低延迟，实时的歌声检测方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Singing voice detection aims at identifying the regions in a music recording where at least one person sings. This is a challenging problem that cannot be solved without analysing the temporal evolution of the signal. Current state-of-the-art methods combine timbral with temporal characteristics, by summarising various feature values over time, e.g. by computing their variance. This leads to more contextual information, but also to increased latency, which is problematic if our goal is on-line, real-time singing voice detection. To overcome this problem and reduce the necessity to include context in the features themselves, we introduce a method that uses Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN). In experiments on several data sets, the resulting singing voice detector outperforms the state-of-the-art baselines in terms of accuracy, while at the same time drastically reducing latency and increasing the time resolution of the detector.

机译：唱歌语音检测旨在识别音乐录音中至少一个人唱歌的区域。这是一个具有挑战性的问题，如果不分析信号的时间演变就无法解决。当前最先进的方法是通过将随时间变化的各种特征值汇总在一起来将音色与时间特征相结合，例如通过计算它们的方差。这将导致更多的上下文信息，但也会导致延迟增加，如果我们的目标是在线实时唱歌语音检测，那么这将是一个问题。为了克服此问题并减少在特征本身中包含上下文的必要性，我们引入了一种使用长短期记忆循环神经网络（LSTM-RNN）的方法。在几个数据集上进行的实验中，最终的歌声检测器在准确性方面优于最新的基线，同时大大减少了等待时间并提高了检测器的时间分辨率。

著录项

来源
《European Signal Processing Conference 》|2015年|21-25|共5页
会议地点
作者
Lehner Bernhard; Widmer Gerhard; Bock Sebastian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
music information retrieval; recurrent neural nets; singing voice detection;

机译：音乐信息检索;递归神经网络;演唱语音检测;

相似文献

外文文献
中文文献
专利

1. Bidirectional LSTM Malicious webpages detection algorithm based on convolutional neural network and independent recurrent neural network [J] . Wang Huan-huan, Yu Long, Tian Sheng-wei, Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2019 ,第8期

机译：基于卷积神经网络和独立复发神经网络的双向LSTM恶意网页检测算法
2. Multimodal Ambulatory Sleep Detection Using LSTM Recurrent Neural Networks [J] . Akane Sano, Weixuan Chen, Daniel Lopez-Martinez, Biomedical and Health Informatics, IEEE Journal of . 2019 ,第4期

机译：使用LSTM递归神经网络进行多模式动态睡眠检测
3. A Gaussian moment method and its augmentation via LSTM recurrent neural networks for the statistics of cavitating bubble populations [J] . Bryngelson Spencer H., Charalampopoulos Alexis, Sapsis Themistoklis P., International Journal of Multiphase Flow . 2020 ,第1期

机译：通过LSTM经常性神经网络的高斯矩法及其增强，用于空气泡沫群的统计数据
4. A low-latency, real-time-capable singing voice detection method with LSTM recurrent neural networks [C] . Lehner Bernhard, Widmer Gerhard, Bock Sebastian European Signal Processing Conference . 2015

机译：具有LSTM经常性神经网络的低延迟，实时拼接语音检测方法
5. Recurrent Neural Networks and Matrix Methods for Cognitive Radio Spectrum Prediction and Security. [D] . Glandon, Alexander M. 2017

机译：递归神经网络和矩阵方法用于认知无线电频谱预测和安全性。
6. LiReD: A Light-Weight Real-Time Fault Detection System for Edge Computing Using LSTM Recurrent Neural Networks [O] . Donghyun Park, Seulgi Kim, Yelin An, 2018

机译：LiReD：使用LSTM递归神经网络进行边缘计算的轻型实时故障检测系统
7. A Low-Latency, Real-Time-Capable Singing Voice Detection Method with LSTM Recurrent Neural Networks [O] . Böck Sebastian, Lehner Bernhard, Widmer Gerhard 2015

机译：具有LSTM递归神经网络的低延迟，实时通话声音检测方法

A low-latency, real-time-capable singing voice detection method with LSTM recurrent neural networks

摘要

著录项

相似文献

相关主题

期刊订阅