首页>
外国专利>
SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS
SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS
展开▼
机译:语音识别使用未介绍的文本和语音合成
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method (500) for training a generative adversarial network (GAN)-based text-to-speech (TTS) model (310) and a speech recognition model (200) in unison includes obtaining a plurality of training text utterances (305) and generating, for output by the GAN-Based TTS model, a synthetic speech representation (306) of the corresponding training text utterance, and determining, using an adversarial discriminator (318), an adversarial loss term (320) indicative of an amount of acoustic noise disparity in a non- synthetic speech representation (304) relative to the corresponding synthetic speech representation of the corresponding training text utterance. The method also includes updating parameters of the GAN-based TTS model based on the adversarial loss term.
展开▼