首页> 外国专利> Unit-selection text-to-speech synthesis based on predicted concatenation parameters

Unit-selection text-to-speech synthesis based on predicted concatenation parameters

机译：基于预测级联参数的单元选择文本到语音合成

页面导航

摘要
著录项
相似文献

摘要

Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.

机译：提供了用于执行单元选择文本到语音合成的系统和过程。在示例过程中，接收要转换为语音的文本。文本表示为目标单元的序列。选择与目标单元的序列相对应的多个候选语音片段。确定与目标单元序列相关联的声学特征的预测统计参数。声学特征的预测统计参数用于确定与多个候选语音片段相关联的目标成本和串联成本。基于从目标成本和串联成本确定的组合成本，从多个候选语音片段中选择候选语音片段的子集。使用候选语音片段的子集生成与接收到的文本相对应的语音。

著录项

公开/公告号US9934775B2

专利类型
公开/公告日2018-04-03

原文格式PDF
申请/专利权人 APPLE INC.;
展开▼

申请/专利号US201615266930
发明设计人 TUOMO J. RAITIO;KISHORE SUNKESWARI PRAHALLAD;ALISTAIR D. CONKIE;LADAN GOLIPOUR;DAVID A. WINARSKY;
展开▼

申请日2016-09-15
分类号G10L13/10;G10L13/033;G10L13/06;
国家 US
入库时间 2022-08-21 12:55:26

相似文献

专利
外文文献
中文文献