首页> 外文会议>Annual Conference of the International Speech Communication Association >Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks
【24h】

Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks

机译:使用深频神经网络的噪声强大的文本与语音合成系统的语音增强

获取原文

摘要

Quality of text-to-speech voices built from noisy recordings is diminished. In order to improve it we propose the use of a recurrent neural network to enhance acoustic parameters prior to training. We trained a deep recurrent neural network using a parallel database of noisy and clean acoustics parameters as input and output of the network. The database consisted of multiple speakers and diverse noise conditions. We investigated using text-derived features as an additional input of the network. We processed a noisy database of two other speakers using this network and used its output to train an HMM acoustic text-to-synthesis model for each voice. Listening experiment results showed that the voice built with enhanced parameters was ranked significantly higher than the ones trained with noisy speech and speech that has been enhanced using a conventional enhancement system. The text-derived features improved results only for the female voice, where it was ranked as highly as a voice trained with clean speech.
机译:从嘈杂录音中建造的文本语音声音的质量减少。为了改善它,我们提出了经常性神经网络在训练前增强声学参数。我们使用并行数据库训练了一个深度经常性的神经网络,并将声学参数的并行数据库作为网络的输入和输出。数据库包括多个扬声器和不同的噪声条件。我们使用文本派生功能作为网络的附加输入进行了调查。我们使用此网络处理了另外两个扬声器的嘈杂数据库,并使用其输出来为每个语音训练HMM声学文本到合成模型。聆听实验结果表明,通过增强参数构建的语音被排名得明显高于使用传统增强系统进行增强的噪音和语音训练的音乐。文本派生的功能仅为女性语音改进了结果,在那里它被排名第一,作为用干净的语音训练的声音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号