首页> 外国专利> Enhancing hybrid self-attention structure with relative-position-aware bias for speech synthesis

Enhancing hybrid self-attention structure with relative-position-aware bias for speech synthesis

机译:用语音合成的相对位置感知偏差增强混合自我关注结构

摘要

A method of performing speech synthesis, includes encoding character embeddings, using any one or any combination of convolutional neural networks (CNNs) and recurrent neural networks (RNNs), applying a relative-position-aware self attention function to each of the character embeddings and an input mel-scale spectrogram, and encoding the character embeddings to which the relative-position-aware self attention function is applied. The method further includes concatenating the encoded character embeddings and the encoded character embeddings to which the relative-position-aware self attention function is applied, to generate an encoder output, applying a multi-head attention function to the encoder output and the input mel-scale spectrogram to which the relative-position-aware self attention function is applied, and predicting an output mel-scale spectrogram, based on the encoder output and the input mel-scale spectrogram to which the multi-head attention function is applied.
机译:执行语音合成的方法,包括使用卷积神经网络(CNNS)和复发性神经网络(RNN)的任何一个或任何组合,将相对位置感知的自我注意函数应用于每个字符嵌入物和嵌入式的字符嵌入输入熔化谱图,并编码应用相对位置感知自我注意功能的字符嵌入。该方法还包括连接编码字符嵌入和应用相对位置感知自我注意功能的编码字符嵌入,以生成编码器输出,将多针注意功能应用于编码器输出和输入熔体 - 基于编码器输出和应用多头注意功能的输入熔化谱图,应用相对定位自我注意功能的刻度谱图,并预测输出熔化型谱图。

著录项

  • 公开/公告号US11011154B2

    专利类型

  • 公开/公告日2021-05-18

    原文格式PDF

  • 申请/专利权人 TENCENT AMERICA LLC;

    申请/专利号US201916271154

  • 发明设计人 SHAN YANG;HENG LU;SHIYIN KANG;DONG YU;

    申请日2019-02-08

  • 分类号G10L13/047;G06N3/04;G06N3/08;G10L13/07;

  • 国家 US

  • 入库时间 2022-08-24 18:43:09

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号