首页>
外国专利>
Neural network generative modeling to transform speech utterances and augment training data
Neural network generative modeling to transform speech utterances and augment training data
展开▼
机译:神经网络生成建模转换语音词语和增强训练数据
展开▼
页面导航
摘要
著录项
相似文献
摘要
Systems, methods, and devices for speech transformation and generating synthetic speech using deep generative models are disclosed. A method of the disclosure includes receiving input audio data comprising a plurality of iterations of a speech utterance from a plurality of speakers. The method includes generating an input spectrogram based on the input audio data and transmitting the input spectrogram to a neural network configured to generate an output spectrogram. The method includes receiving the output spectrogram from the neural network and, based on the output spectrogram, generating synthetic audio data comprising the speech utterance.
展开▼