首页> 外国专利> HYPER-STRUCTURE RECURRENT NEURAL NETWORKS FOR TEXT-TO-SPEECH

HYPER-STRUCTURE RECURRENT NEURAL NETWORKS FOR TEXT-TO-SPEECH

机译：文本到语音的超结构递归神经网络

页面导航

摘要
著录项
相似文献

摘要

The technology relates to converting text to speech utilizing recurrent neural networks (RNNs). The recurrent neural networks may be implemented as multiple modules for determining properties of the text. In embodiments, a part-of-speech RNN module, letter-to-sound RNN module, a linguistic prosody tagger RNN module, and a context awareness and semantic mining RNN module may all be utilized. The properties from the RNN modules are processed by a hyper-structure RNN module that determine the phonetic properties of the input text based on the outputs of the other RNN modules. The hyper-structure RNN module may generate a generation sequence that is capable of being converting to audible speech by a speech synthesizer. The generation sequence may also be optimized by a global optimization module prior to being synthesized into audible speech.

机译：该技术涉及利用递归神经网络（RNN）将文本转换为语音。递归神经网络可以被实现为用于确定文本的属性的多个模块。在实施例中，可以全部使用词性RNN模块，字母转RNN模块，语言韵律标记器RNN模块以及上下文感知和语义挖掘RNN模块。 RNN模块的属性由超结构RNN模块处理，该模块根据其他RNN模块的输出确定输入文本的语音属性。超结构RNN模块可以生成能够被语音合成器转换为可听语音的生成序列。在被合成为可听语音之前，生成序列还可以由全局优化模块来优化。

著录项

公开/公告号EP3155613A1

专利类型
公开/公告日2017-04-19

原文格式PDF
申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;
展开▼

申请/专利号EP20150738162
发明设计人 ZHAO PEI;LEUNG MAX;YAO KAISHENG;YAN BO;ZHAO SHENG;ALLEVA FILENO A.;
展开▼

申请日2015-06-12
分类号G10L13/10;
国家 EP
入库时间 2022-08-21 14:04:25

相似文献

专利
外文文献
中文文献