首页> 外国专利> Employing speech models in concatenative speech synthesis

Employing speech models in concatenative speech synthesis

机译:在级联语音合成中使用语音模型

摘要

A text-to-speech synthesizer employs database that includes units. For each unit there is a collection of unit selection parameters and a plurality of frames. Each frame has a set of model parameters derived from a base speech frame, and a speech frame synthesized from the frame's model parameters. A text to be synthesized is converted to a sequence of desired unit features sets, and for each such set the database is perused to retrieve a best-matching unit. An assessment is made whether modifications to the frames are needed, because of discontinuities in the model parameters at unit boundaries, or because of differences between the desired and selected unit features. When modifications are necessary, the model parameters of frames that need to be altered are modified, and new frames are synthesized from the modified model parameters and concatenated to the output. Otherwise, the speech frames previously stored in the database are retrieved and concatenated to the output.
机译:文本语音合成器使用包含单位的数据库。对于每个单元,存在单元选择参数和多个帧的集合。每个帧具有一组从基本语音帧派生的模型参数,以及从帧的模型参数合成的语音帧。将要合成的文本转换为所需单位特征集的序列,并且对于每个这样的集合,都将使用数据库来检索最佳匹配的单位。评估是否需要修改框架,是由于模型参数在单元边界处的不连续性,还是由于所需和所选单元特征之间的差异。当需要修改时,需要修改的帧的模型参数会被修改,然后从修改后的模型参数中合成新的帧并将其连接到输出。否则,将检索先前存储在数据库中的语音帧并将其连接到输出。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号