首页> 外国专利> METHOD AND APPARATUS, MEDIUM, AND DEVICE FOR SPEECH SYNTHESIS BASED ON PROSODIC BOUNDARY

METHOD AND APPARATUS, MEDIUM, AND DEVICE FOR SPEECH SYNTHESIS BASED ON PROSODIC BOUNDARY

机译：基于韵律边界的语音合成方法和装置，介质和装置

页面导航

摘要
著录项
相似文献

摘要

Provided are a method and apparatus, medium, and device for speech synthesis based on a prosodic boundary, said method comprising: obtaining prosodic boundary information of text information to be synthesized, and generating image embedded information on the basis of the prosodic boundary information (S102); generating a hidden state vector of the image embedded information and the sequence coding of the text information to be synthesized (S104); generating a speech spectrum on the basis of the hidden state vector and sequence coding (S106); according to the speech spectrum, synthesizing the speech information of the text information to be synthesized (S108). The semantic and grammatical structure of a sentence can be analyzed from the text side, and the prosodic boundary is represented by means of image embedding, such that the prosodic information in the text can be fully involved in training and reasoning, improving the sense of prosody of the synthesized speech information. The invention also relates to blockchain technology; the hidden state vector and the sequence coding of the text information to be synthesized are stored in the blockchain, thus improving the security of data storage.

机译：提供了一种基于韵律边界的语音合成的方法和装置，介质和装置，所述方法包括：获得要合成文本信息的韵律边界信息，并基于韵律边界信息生成图像嵌入信息（S102 ）;生成图像嵌入信息的隐藏状态向量和要合成的文本信息的序列编码（S104）;基于隐藏状态向量和序列编码生成语音频谱（S106）;根据语音频谱，合成要合成的文本信息的语音信息（S108）。句子的语义和语法结构可以从文本侧分析，并且韵律边界通过嵌入的方式表示，使文本中的韵律信息可以完全涉及培训和推理，从而提高韵律的感觉合成的语音信息。本发明还涉及区间结构;隐藏状态向量和要合成的文本信息的序列编码存储在区块链中，从而提高了数据存储的安全性。

著录项

公开/公告号WO2021174874A1

专利类型
公开/公告日2021-09-10

原文格式PDF
申请/专利权人 PING AN TECHNOLOGY (SHENZHEN) CO. LTD.;
展开▼

申请/专利号WO2020CN124257
发明设计人 SUN AOLAN;WANG JIANZONG;CHENG NING;
展开▼

申请日2020-10-28
分类号G10L13/10;G10L13/027;
国家 CN
入库时间 2022-08-24 20:59:23

相似文献

专利
外文文献
中文文献