首页>
外国专利>
UNIT-SELECTION TEXT-TO-SPEECH SYNTHESIS USING CONCATENATION-SENSITIVE NEURAL NETWORKS
UNIT-SELECTION TEXT-TO-SPEECH SYNTHESIS USING CONCATENATION-SENSITIVE NEURAL NETWORKS
展开▼
机译:基于连接敏感神经网络的单元选择文本到语音合成
展开▼
页面导航
摘要
著录项
相似文献
摘要
Systems and processes for performing unit-selection text-to-speech synthesis are provided. In one example process, a sequence of target units can represent a spoken pronunciation of text. A set of predicted acoustic model parameters of a second target unit can be determined using a set of acoustic features of a first candidate speech segment of a first target unit and a set of linguistic features of the second target unit. A likelihood score of the second candidate speech segment with respect to the first candidate speech segment can be determined using the set of predicted acoustic model parameters of the second target unit and a set of acoustic features of the second candidate speech segment of the second target unit. The second candidate speech segment can be selected for speech synthesis based on the determined likelihood score. Speech corresponding to the received text can be generated using the selected second candidate speech segment.
展开▼