首页> 外国专利> TEXT-TO-SPEECH SYNTHESIS METHOD, DEVICE, COMPUTER APPARATUS, AND NON-VOLATILE COMPUTER READABLE STORAGE MEDIUM

TEXT-TO-SPEECH SYNTHESIS METHOD, DEVICE, COMPUTER APPARATUS, AND NON-VOLATILE COMPUTER READABLE STORAGE MEDIUM

机译:文本到语音合成方法,设备,计算机设备和非易失性计算机可读存储介质

摘要

A text-to-speech synthesis method, a device, and a computer apparatus. The text-to-speech synthesis method comprises: first acquiring a target text to be identified (101); performing discrete feature processing on each character in the target text to generate a corresponding feature vector for each character (102); inputting the feature vector into a pre-trained frequency spectrum conversion model, and acquiring a corresponding Mel-spectrum for each character in the target text, the Mel-spectrum being output by the frequency spectrum conversion model (103); and converting the Mel-spectrum into audio data to obtain audio data corresponding to the target text (104). Thus, speech synthesis is performed without generating phonemic notation of each character in a text so as to effectively reduce a workload during a speech synthesis process, provide an effective solution for pronunciation issues during the speech synthesis process, and achieve a wide application range in the field of artificial intelligence.
机译:文本语音合成方法,设备和计算机设备。文本到语音合成方法包括:首先获取要识别的目标文本(101);对目标文本中的每个字符进行离散特征处理,以为每个字符生成对应的特征矢量(102);将特征向量输入预先训练的频谱转换模型中,并为目标文本中的每个字符获取对应的梅尔谱,所述梅尔谱由频谱转换模型输出(103);将Mel频谱转换为音频数据,得到与目标文本对应的音频数据(104)。因此,进行语音合成时不会在文本中生成每个字符的音标,从而有效地减少了语音合成过程中的工作量,为语音合成过程中的发音问题提供了有效的解决方案,并在语音合成中获得了广泛的应用范围。人工智能领域。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号