首页> 外国专利> UNIT-SELECTION TEXT-TO-SPEECH SYNTHESIS USING CONCATENATION-SENSITIVE NEURAL NETWORKS

UNIT-SELECTION TEXT-TO-SPEECH SYNTHESIS USING CONCATENATION-SENSITIVE NEURAL NETWORKS

机译：基于连接敏感神经网络的单元选择文本到语音合成

页面导航

摘要
著录项
相似文献

摘要

Systems and processes for performing unit-selection text-to-speech synthesis are provided. In one example process, a sequence of target units can represent a spoken pronunciation of text. A set of predicted acoustic model parameters of a second target unit can be determined using a set of acoustic features of a first candidate speech segment of a first target unit and a set of linguistic features of the second target unit. A likelihood score of the second candidate speech segment with respect to the first candidate speech segment can be determined using the set of predicted acoustic model parameters of the second target unit and a set of acoustic features of the second candidate speech segment of the second target unit. The second candidate speech segment can be selected for speech synthesis based on the determined likelihood score. Speech corresponding to the received text can be generated using the selected second candidate speech segment.

机译：提供了用于执行单元选择文本到语音合成的系统和过程。在一个示例过程中，一系列目标单元可以表示文本的口头发音。可以使用第一目标单元的第一候选语音片段的一组声学特征和第二目标单元的一组语言特征来确定第二目标单元的一组预测声学模型参数。可以使用第二目标单元的预测声学模型参数集和第二目标单元的第二候选语音片段的声学特征集来确定第二候选语音片段相对于第一候选语音片段的似然分数。。可以基于所确定的似然分数来选择第二候选语音片段用于语音合成。可以使用选择的第二候选语音片段来生成与接收到的文本相对应的语音。

著录项

公开/公告号US2017092259A1

专利类型
公开/公告日2017-03-30

原文格式PDF
申请/专利权人 APPLE INC.;
展开▼

申请/专利号US201514961370
发明设计人 WOOJAY JEON;
展开▼

申请日2015-12-07
分类号G10L13/07;G10L13/047;G10L13/08;
国家 US
入库时间 2022-08-21 13:47:01

相似文献

专利
外文文献
中文文献