首页> 外国专利> HYPER-STRUCTURE RECURRENT NEURAL NETWORKS FOR TEXT-TO-SPEECH

HYPER-STRUCTURE RECURRENT NEURAL NETWORKS FOR TEXT-TO-SPEECH

机译:文本到语音的超结构递归神经网络

摘要

The technology relates to converting text to speech utilizing recurrent neural networks (RNNs). The recurrent neural networks may be implemented as multiple modules for determining properties of the text. In embodiments, a part-of-speech RNN module, letter-to-sound RNN module, a linguistic prosody tagger RNN module, and a context awareness and semantic mining RNN module may all be utilized. The properties from the RNN modules are processed by a hyper-structure RNN module that determine the phonetic properties of the input text based on the outputs of the other RNN modules. The hyper-structure RNN module may generate a generation sequence that is capable of being converting to audible speech by a speech synthesizer. The generation sequence may also be optimized by a global optimization module prior to being synthesized into audible speech.
机译:该技术涉及利用递归神经网络(RNN)将文本转换为语音。递归神经网络可以被实现为用于确定文本的属性的多个模块。在实施例中,可以全部使用词性RNN模块,字母转RNN模块,语言韵律标记器RNN模块以及上下文感知和语义挖掘RNN模块。 RNN模块的属性由超结构RNN模块处理,该模块根据其他RNN模块的输出确定输入文本的语音属性。超结构RNN模块可以生成能够被语音合成器转换为可听语音的生成序列。在被合成为可听语音之前,生成序列还可以由全局优化模块来优化。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号