首页> 外文期刊>IEEE signal processing letters >DBN-based Spectral Feature Representation for Statistical Parametric Speech Synthesis
【24h】

DBN-based Spectral Feature Representation for Statistical Parametric Speech Synthesis

机译:统计参数语音合成的基于DBN的频谱特征表示

获取原文
获取原文并翻译 | 示例
           

摘要

This letter presents a method of deriving spectral features using a deep belief network (DBN) for hidden Markov model (HMM)-based parametric speech synthesis. At training time, a DBN is estimated to represent the high-dimensional spectral envelopes and then transforms them into binary codes. These DBN-based binary codes (DBCs) are used as spectral features for HMM modeling. At synthesis time, spectral envelopes are recovered from the predicted DBC sequences and then used for waveform reconstruction. Experimental results show that our proposed method can achieve better naturalness than the conventional method using mel-cepstra as spectral features and considering global variance (GV) during parameter generation.
机译:这封信提出了一种使用深度置信网络(DBN)导出频谱特征的方法,用于基于隐马尔可夫模型(HMM)的参量语音合成。在训练时,估计一个DBN代表高维频谱包络,然后将它们转换为二进制代码。这些基于DBN的二进制代码(DBC)用作HMM建模的频谱特征。在合成时,从预测的DBC序列中恢复频谱包络,然后将其用于波形重建。实验结果表明,与以mel-cepstra为谱特征并在参数生成过程中考虑全局方差(GV)的常规方法相比,我们提出的方法具有更好的自然性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号