首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >A novel pitch extraction based on jointly trained deep BLSTM Recurrent Neural Networks with bottleneck features
【24h】

A novel pitch extraction based on jointly trained deep BLSTM Recurrent Neural Networks with bottleneck features

机译:基于联合训练的深蓝色的新型BLSTM复发性神经网络具有瓶颈特征的新型俯仰萃取

获取原文

摘要

Pitch is an important characteristic of speech and is useful for many applications. However, it is still challenging to estimate pitch in strong noise. In this paper, we propose a joint training approach to determinate pitch. First, a Bidirectional Long Short-Term Memory Recurrent Neural Networks (BLSTMRNN) is trained to map the noisy to clean speech features. Second, the pitch estimation is also a BLSTM-RNN model. The feature mapping neural network serves as a noise normalization module aiming at explicitly generating the clean features which are easier to estimate pitch by the following neural network. BLSTM-RNN is trained on sequential frame-level features and capable of learning temporal dynamics. We also propose to take into account bottleneck features for pitch estimation. The experimental results show that the proposed method can obtain accurate pitch estimation and they show good generalization ability to new speakers and noisy conditions. The proposed approach also significantly outperforms other state-of-the-art pitch estimation algorithms.
机译:音高是言语的重要特征,对许多应用有用。然而,估计强大的噪音的音调仍然挑战。在本文中,我们提出了一种联合培训方法来确定音高。首先,培训双向长期内记忆经常性神经网络(BLSTMRNN)以映射嘈杂以清洁语音特征。其次,间距估计也是BLSTM-RNN模型。所述特征映射神经网络作为噪声归一化模块,旨在明确地生成清洁特征,其更容易通过以下的神经网络来估计音调。 BLSTM-RNN培训在顺序帧级别特征上,并能够学习时间动态。我们还建议考虑到音高估计的瓶颈功能。实验结果表明,该方法可以获得准确的音高估计,它们对新扬声器和嘈杂的条件表现出良好的概括能力。所提出的方法也显着优于其他最先进的音高估计算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号