Speech Synthesis of Chinese Braille with Limited Training Data

机译：培训数据有限的中国盲文综合

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes to our knowledge the first Chinese Braille speech synthesis system. The system consists of modules of Braille front-end processing, prosody prediction, and speech synthesis. The Braille front-end processing includes conversion from the common Braille to Pinyin, and a high-precision Chinese character prediction model. To achieve high precision prosody prediction under limited corpus conditions, we propose a prosody prediction model based on the RoBERTa pre-trained model, which achieves an accuracy of 94.42%. Finally, a real-time TTS system based on Tacotron2 and LPCNet is proposed. We modify Tacotron2, including introducing a forward attention mechanism and extending the autoregressive correlation step size to obtain more natural speech.

机译：本文介绍了我们知识的第一款中国盲文语音合成系统。该系统由盲文前端处理，韵律预测和语音合成的模块组成。盲文前端处理包括从普通盲文转换为拼音，以及高精度的汉字预测模型。为了在有限的语料库条件下实现高精度韵律预测，我们提出了一种基于Roberta预训练模型的韵律预测模型，这使得精度为94.42％。最后，提出了一种基于Tacotron2和LPCNet的实时TTS系统。我们修改Tacotron2，包括引入前向关注机制，并扩展自动报告相关步长，以获得更自然的语音。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2021年|1-6|共6页
会议地点
作者
Jianguo Mao; Jingwen Zhu; Xiangdong Wang; Hong Liu; Yueliang Qian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Correlation; Conferences; Natural languages; Training data; Predictive models; Real-time systems; Speech synthesis;

机译：相关性;会议;自然语言;培训数据;预测模型;实时系统;语音合成;

相似文献

外文文献
中文文献
专利

1. Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data [J] . Hsin-Min Wang, Tai-Hsuan Ho IEEE Transactions on Speech and Audio Proceeding . 1997,第2期

机译：使用有限的训练数据就可以完全识别具有很大词汇量的连续汉语普通话语音
2. Complete recognition of continuous Mandarin speech for Chineselanguage with very large vocabulary using limited training data [J] . Hsin-Min Wang, Tai-Hsuan Ho, Rung-Chiung Yang, IEEE Transactions on Speech and Audio Proceessing . 1997,第2期

机译：使用有限的培训数据，可以完全识别具有很大词汇量的连续汉语普通话语音
3. Development of training system for people with visual impairment to learn Japanese hand writing guided by Braille graphic display and speech output [J] . Hiroaki Yuze, Chosei Yo, Jun Ishikawa 電子情報通信学会技術研究報告. 教育工学. Educational Technology . 2002,第697期

机译：视力障碍者的盲文图形显示和语音输出指导学习日语手写的训练系统的开发
4. Hierarchical English Emphatic Speech Synthesis Based on HMM with Limited Training Data [C] . Fanbo Meng, Zhiyong Wu, Helen Meng, Annual conference of the International Speech Communication Association . 2012

机译：基于有限训练数据的基于HMM的分级英语口语语音合成
5. Hidden Markov models for visual speech synthesis in limited data environments. [D] . Arb, Harold Allan. 2001

机译：用于有限数据环境中视觉语音合成的隐马尔可夫模型。
6. Limited Pre-Speech Auditory Modulation in Individuals Who Stutter: Data and Hypotheses [O] . Ludo Max, Ayoub Daliri -1

机译：口吃者的有限语音前听觉调节：数据和假设
7. Speech Synthesis of Chinese Braille with Limited Training Data [O] . Jianguo Mao, Jingwen Zhu, Xiangdong Wang, 2021

机译：培训数据有限的中国盲文的语音合成

Speech Synthesis of Chinese Braille with Limited Training Data

摘要

著录项

相似文献

相关主题

期刊订阅