A Practical Singing Voice Detection System Based on GRU-RNN

机译：基于GRU-RNN的实用唱歌语音检测系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a practical three-step approach for singing voice detection based on a gated recurrent unit (GRU) recurrent neural network (RNN) and the proposed method achieves comparable results to state-of-the-art method. We combine four classic features—namely Mel-frequency Cepstral Coefficients (MFCC), Mel-filter Bank, Linear Predictive Cepstral Coefficients (LPCC), and Chroma. Then, the mixed signal is first preprocessed by singing voice separation (SVS) with the Deep U-Net Convolutional Networks. Long short-term memory (LSTM) and GRU are both proposed to solve the gradient vanish problem in RNN. In our experiments, we set the block duration as 120 ms and 720 ms respectively, and we get comparable or better results than results from state-of-the-art methods, while results on Jamendo are not as good as those from RWC-Pop.

机译：在本文中，我们提出了一种基于门控递归单元（GRU）递归神经网络（RNN）的实用的三步歌唱语音检测方法，该方法取得了与最新方法相当的结果。我们结合了四个经典功能-即梅尔频率倒谱系数（MFCC），梅尔滤波器组，线性预测倒谱系数（LPCC）和色度。然后，首先使用Deep U-Net卷积网络通过唱歌语音分离（SVS）对混合信号进行预处理。提出了长短期记忆（LSTM）和GRU来解决RNN中的梯度消失问题。在我们的实验中，我们将块持续时间分别设置为120 ms和720 ms，我们得到的结果与最新方法的结果可比或更好，而Jamendo的结果不如RWC-Pop的结果好。。

著录项

来源
《Conference on sound and music technology》|2018年|15-25|共11页
会议地点 Xiamen(CN)
作者
Zhigao Chen; Xulong Zhang; Jin Deng; Juanjuan Li; Yiliang Jiang; Wei Li;
展开▼
作者单位

Department of Computer Science Fudan University 201203 Shanghai China;

Department of Computer Science Fudan University 201203 Shanghai China Shanghai Key Laboratory of Intelligent Information Processing Fudan University 201203 Shanghai China;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Singing voice detection (SVD); Gated recurrent unit (GRU); Recurrent neural network (RNN); Music information retrieval (MIR);

机译：唱歌语音检测（SVD）；门控循环单元（GRU）;递归神经网络（RNN）;音乐信息检索（MIR）;

相似文献

外文文献
中文文献
专利

1. HMM-based expressive singing voice synthesis with singing style control and robust pitch modeling [J] . Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Computer speech and language . 2015,第1期

机译：基于HMM的表达性歌声合成，具有歌唱风格控制和可靠的音高建模
2. A HMM-based mandarin chinese singing voice synthesis system [J] . X. Li, Z. Wang Automatica Sinica, IEEE/CAA Journal of . 2016,第2期

机译：基于HMM的普通话中文语音合成系统。
3. A HMM-based Mandarin Chinese Singing Voice Synthesis System [J] . Xian Li, Zengfu Wang 自动化学报：英文版 . 2016,第002期

机译：基于HMM的普通话演唱语音合成系统
4. A Practical Singing Voice Detection System Based on GRU-RNN [C] . Zhigao Chen, Xulong Zhang, Jin Deng, Conference on sound and music technology . 2019

机译：基于GRU-RNN的实用歌唱语音检测系统
5. Conceptual Change and Science Achievement Related to a Lesson Sequence on Acids and Bases Among African American Alternative High School Students: A Teacher's Practical Arguments and the Voice of the "Other". [D] . Wood, Lynda Charese. 2012

机译：与非裔美国人替代高中生有关酸和碱的课程顺序有关的概念变化和科学成就：教师的实践争论和“其他”的声音。
6. A New Method to Explore the Spectral Impact of the Piriform Fossae on the Singing Voice: Benchmarking Using MRI-Based 3D-Printed Vocal Tracts [O] . Bertrand Delvaux, David Howard -1

机译：探索梨状窝对歌声的频谱影响的新方法：使用基于MRI的3D打印声带进行基准测试
7. Singing Voice Detection in Music Tracks using Direct Voice Vibrato Detection [O] . Regnier, Lise, Peeters, Geoffroy 2009

机译：使用直接语音颤音检测在音乐曲目中唱歌语音检测

A Practical Singing Voice Detection System Based on GRU-RNN

摘要

著录项

相似文献

相关主题

期刊订阅