首页> 外国专利> Unit-selection text-to-speech synthesis based on predicted concatenation parameters

Unit-selection text-to-speech synthesis based on predicted concatenation parameters

机译:基于预测级联参数的单元选择文本到语音合成

摘要

Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.
机译:提供了用于执行单元选择文本到语音合成的系统和过程。在示例过程中,接收要转换为语音的文本。文本表示为目标单元的序列。选择与目标单元的序列相对应的多个候选语音片段。确定与目标单元序列相关联的声学特征的预测统计参数。声学特征的预测统计参数用于确定与多个候选语音片段相关联的目标成本和串联成本。基于从目标成本和串联成本确定的组合成本,从多个候选语音片段中选择候选语音片段的子集。使用候选语音片段的子集生成与接收到的文本相对应的语音。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号