Chinese Speech Synthesis System Based on End to End

机译：基于端到端的中文语音合成系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes an end-to-end Chinese speech synthesis scheme, a neural network architecture for speech synthesis directly from text, which is based on the improvement of tacotron2. In order to improve the original seq2seq model, multi attention mechanism is used to replace the original position-based attention mechanism to improve the sound quality of synthetic speech. The prosodic style coding module is added to better show the prosodic and stylistic characteristics of the synthesized speech. Then an LpcNet vocoder is used to replace the original WaveNet to generate time-domain waveform, which improves the synthesis efficiency and reduces the synthesis time. The experiments show that the speech synthesis system is effective and timely. The synthesized speech is natural and has good prosodic style.

机译：本文基于tacotron2的改进，描述了一种端到端中文语音合成方案，这是一种直接从文本进行语音合成的神经网络体系结构。为了改善原始seq2seq模型，使用多注意机制代替了基于位置的原始注意机制，以提高合成语音的音质。添加了韵律样式编码模块，以更好地显示合成语音的韵律和风格特征。然后使用LpcNet声码器代替原始WaveNet生成时域波形，从而提高了合成效率并减少了合成时间。实验表明，语音合成系统是有效，及时的。合成的语音是自然的并且具有良好的韵律风格。

著录项

来源
《IEEE International Conference on Software Engineering and Service Science》|2020年|273-277|共5页
会议地点
作者
Hao Zhang; Xiaojun Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
tacotron2; Chinese speech synthesis; multi attention mechanism; prosodic coding; LpcNet;

机译：tacotron2;中文语音合成;多注意力机制;韵律编码; LpcNet;

相似文献

外文文献
中文文献
专利

1. A speech synthesis system based on speech production process - Development of a Web-based system [J] . Kohichi OGATA, Yorinobu SONODA, Shinpei FU JII 電子情報通信学会技術研究報告. 音声. Speech . 2003,第264期

机译：基于语音产生过程的语音合成系统-基于Web的系统的开发
2. A speech synthesis system based on speech production process - Development of a Web-based system [J] . Kohichi OGATA, Yorinobu SONODA, Shinpei FU JII 電子情報通信学会技術研究報告. 音声. Speech . 2003,第264期

机译：基于语音生产过程的语音合成系统 - 基于Web的系统开发
3. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
4. Support Vector Machine for Chinese Part-Of-Speech Tagging in Speech Synthesis Systems [C] . Wang Xiang, Zhang Jianping, Yan Yonghong 2010 International Conference on Biomedical Engineering and Computer Science . 2010

机译：语音合成系统中中文词性标注的支持向量机
5. A formant-based linear prediction speech synthesis/analysis system. [D] . Shue, Yean-Jen. 1995

机译：基于共振峰的线性预测语音合成/分析系统。
6. Predicting Speech Intelligibility Decline in Amyotrophic Lateral Sclerosis Based on the Deterioration of Individual Speech Subsystems [O] . Panying Rong, Yana Yunusova, Jun Wang, -1

机译：基于单个语音子系统恶化的肌萎缩性侧索硬化症的语音清晰度下降预测
7. AN OPTIMIZED NEURAL NETWORK BASED PROSODY MODEL OF CHINESE SPEECH SYNTHESIS SYSTEM * [O] . 2015

机译：基于优化神经网络的中文语音合成系统前瞻模型*

Chinese Speech Synthesis System Based on End to End

摘要

著录项

相似文献

相关主题

期刊订阅