...
首页> 外文期刊>Intelligent decision technologies >Statistical parametric speech synthesis with a novel codebook-based excitation model
【24h】

Statistical parametric speech synthesis with a novel codebook-based excitation model

机译:统计参数语音合成与基于新型密码本的激励模型

获取原文
获取原文并翻译 | 示例

摘要

Speech synthesis is an important modality in Cognitive Infocommunications, which is the intersection of informatics and cognitive sciences. Statistical parametric methods have gained importance in speech synthesis recently. The speech signal is decomposed to parameters and later restored from them. The decomposition is implemented by speech coders. We apply a novel codebook-based speech coding method to model the excitation of speech. In the analysis stage the speech signal is analyzed frame-by-frame and a codebook of pitch synchronous excitations is built from the voiced parts. Timing, gain and harmonic-to-noise ratio parameters are extracted and fed into the machine learning stage of Hidden Markov-model based speech synthesis. During the synthesis stage the codebook is searched for a suitable element in each voiced frame and these are concatenated to create the excitation signal, from which the final synthesized speech is created. Our initial experiments show that the model fits well in the statistical parametric speech synthesis framework and in most cases it can synthesize speech in a better quality than the traditional pulse-noise excitation. (This paper is an extended version of [10].)
机译:语音合成是认知信息通信中的一种重要形式,它是信息学与认知科学的交集。统计参数方法最近在语音合成中变得越来越重要。语音信号被分解为参数,然后从中恢复。分解由语音编码器实现。我们应用一种新颖的基于密码本的语音编码方法来对语音激励进行建模。在分析阶段,对语音信号进行逐帧分析,并从发声部分构建音高同步激励的码本。提取时间,增益和谐波噪声比参数,并将其输入到基于隐马尔可夫模型的语音合成的机器学习阶段。在合成阶段,在每个有声帧中搜索码本以寻找合适的元素,然后将它们级联以创建激励信号,从中创建最终的合成语音。我们的初步实验表明,该模型非常适合统计参数语音合成框架,并且在大多数情况下,与传统的脉冲噪声激励相比,它可以以更好的质量合成语音。 (本文是[10]的扩展版本。)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号