首页> 外文会议>Conference on sound and music technology >A Practical Singing Voice Detection System Based on GRU-RNN
【24h】

A Practical Singing Voice Detection System Based on GRU-RNN

机译:基于GRU-RNN的实用唱歌语音检测系统

获取原文

摘要

In this paper, we present a practical three-step approach for singing voice detection based on a gated recurrent unit (GRU) recurrent neural network (RNN) and the proposed method achieves comparable results to state-of-the-art method. We combine four classic features—namely Mel-frequency Cepstral Coefficients (MFCC), Mel-filter Bank, Linear Predictive Cepstral Coefficients (LPCC), and Chroma. Then, the mixed signal is first preprocessed by singing voice separation (SVS) with the Deep U-Net Convolutional Networks. Long short-term memory (LSTM) and GRU are both proposed to solve the gradient vanish problem in RNN. In our experiments, we set the block duration as 120 ms and 720 ms respectively, and we get comparable or better results than results from state-of-the-art methods, while results on Jamendo are not as good as those from RWC-Pop.
机译:在本文中,我们提出了一种基于门控递归单元(GRU)递归神经网络(RNN)的实用的三步歌唱语音检测方法,该方法取得了与最新方法相当的结果。我们结合了四个经典功能-即梅尔频率倒谱系数(MFCC),梅尔滤波器组,线性预测倒谱系数(LPCC)和色度。然后,首先使用Deep U-Net卷积网络通过唱歌语音分离(SVS)对混合信号进行预处理。提出了长短期记忆(LSTM)和GRU来解决RNN中的梯度消失问题。在我们的实验中,我们将块持续时间分别设置为120 ms和720 ms,我们得到的结果与最新方法的结果可比或更好,而Jamendo的结果不如RWC-Pop的结果好。 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号