首页> 外国专利> SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS

SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS

机译：语音识别使用未介绍的文本和语音合成

页面导航

摘要
著录项
相似文献

摘要

A method (500) for training a generative adversarial network (GAN)-based text-to-speech (TTS) model (310) and a speech recognition model (200) in unison includes obtaining a plurality of training text utterances (305) and generating, for output by the GAN-Based TTS model, a synthetic speech representation (306) of the corresponding training text utterance, and determining, using an adversarial discriminator (318), an adversarial loss term (320) indicative of an amount of acoustic noise disparity in a non- synthetic speech representation (304) relative to the corresponding synthetic speech representation of the corresponding training text utterance. The method also includes updating parameters of the GAN-based TTS model based on the adversarial loss term.

机译：用于训练生成的对冲网络（GAN）的文本到语音（TTS）模型（310）和语音识别模型（200）的方法（500）包括获得多个训练文本话语（305）和用于通过基于GaN的TTS模型的输出，使用对方鉴别器（318）的相应训练文本话语的合成语音表示（306），并确定指示一定量的声学的侵扰术语（320）相对于相应训练文本话语的相应合成语音表示，非合成语音表示（304）中的噪声差异。该方法还包括基于对抗丢失项的基于GaN的TTS模型的参数。

著录项

公开/公告号WO2021225829A1

专利类型
公开/公告日2021-11-11

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号WO2021US29501
发明设计人 CHEN ZHEHUAI;ROSENBERG ANDREW;RAMABHADRAN BHUVANA;MORENO MENGIBAR PEDRO J.;
展开▼

申请日2021-04-27
分类号G10L13;
国家 US
入库时间 2022-08-24 22:29:23

相似文献

专利
外文文献
中文文献