首页> 外文会议>IEEE International Conference on Multimedia and Expo >Speech Synthesis of Chinese Braille with Limited Training Data
【24h】

Speech Synthesis of Chinese Braille with Limited Training Data

机译:培训数据有限的中国盲文综合

获取原文

摘要

This paper describes to our knowledge the first Chinese Braille speech synthesis system. The system consists of modules of Braille front-end processing, prosody prediction, and speech synthesis. The Braille front-end processing includes conversion from the common Braille to Pinyin, and a high-precision Chinese character prediction model. To achieve high precision prosody prediction under limited corpus conditions, we propose a prosody prediction model based on the RoBERTa pre-trained model, which achieves an accuracy of 94.42%. Finally, a real-time TTS system based on Tacotron2 and LPCNet is proposed. We modify Tacotron2, including introducing a forward attention mechanism and extending the autoregressive correlation step size to obtain more natural speech.
机译:本文介绍了我们知识的第一款中国盲文语音合成系统。 该系统由盲文前端处理,韵律预测和语音合成的模块组成。 盲文前端处理包括从普通盲文转换为拼音,以及高精度的汉字预测模型。 为了在有限的语料库条件下实现高精度韵律预测,我们提出了一种基于Roberta预训练模型的韵律预测模型,这使得精度为94.42%。 最后,提出了一种基于Tacotron2和LPCNet的实时TTS系统。 我们修改Tacotron2,包括引入前向关注机制,并扩展自动报告相关步长,以获得更自然的语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号