首页> 外国专利> SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS

SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS

机译:语音识别使用未介绍的文本和语音合成

摘要

A method (500) for training a generative adversarial network (GAN)-based text-to-speech (TTS) model (310) and a speech recognition model (200) in unison includes obtaining a plurality of training text utterances (305) and generating, for output by the GAN-Based TTS model, a synthetic speech representation (306) of the corresponding training text utterance, and determining, using an adversarial discriminator (318), an adversarial loss term (320) indicative of an amount of acoustic noise disparity in a non- synthetic speech representation (304) relative to the corresponding synthetic speech representation of the corresponding training text utterance. The method also includes updating parameters of the GAN-based TTS model based on the adversarial loss term.
机译:用于训练生成的对冲网络(GAN)的文本到语音(TTS)模型(310)和语音识别模型(200)的方法(500)包括获得多个训练文本话语(305)和 用于通过基于GaN的TTS模型的输出,使用对方鉴别器(318)的相应训练文本话语的合成语音表示(306),并确定指示一定量的声学的侵扰术语(320) 相对于相应训练文本话语的相应合成语音表示,非合成语音表示(304)中的噪声差异。 该方法还包括基于对抗丢失项的基于GaN的TTS模型的参数。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号