MULTI-DISTRIBUTION DEEP BELIEF NETWORK FOR SPEECH SYNTHESIS

机译：语音合成多分布深度信仰网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep belief network (DBN) has been shown to be a good generative model in tasks such as hand-written digit image generation. Previous work on DBN in the speech community mainly focuses on using the generatively pre-trained DBN to initialize a discriminative model for better acoustic modeling in speech recognition (SR). To fully utilize its generative nature, we propose to model the speech parameters including spectrum and FO simultaneously and generate these parameters from DBN for speech synthesis. Compared with the predominant HMM-based approach, objective evaluation shows that the spectrum generated from DBN has less distortion. Subjective results also confirm the advantage of the spectrum from DBN, and the overall quality is comparable to that of context-independent HMM.

机译：深度信仰网络（DBN）已被证明是诸如手写的数字图像生成的任务中的良好生成模型。在语音界中的DBN上的先前工作主要侧重于使用一般性预先训练的DBN来初始化语音识别中更好的声学建模的判别模型（SR）。为了充分利用其生成性质，我们建议将包括频谱和FO的语音参数进行模拟，并从DBN生成这些参数，用于语音合成。与主要的肝脏基础方法相比，客观评价表明，从DBN产生的光谱具有较小的失真。主观结果还确认了DBN的光谱的优势，并且整体质量与上下文的HMM相当。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|2013年||共5页
会议地点
作者
Shiyin Kang; Xiaojun Qian; Helen Meng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Intonation classification for L2 English speech using multi-distribution deep neural networks [J] . Kun Li, Xixin Wu, Helen Meng Computer speech and language . 2017,第MAY期

机译：基于多分布深度神经网络的L2英语语音语调分类
2. Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis [J] . Ling, Z.-H., Deng, Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第10期

机译：使用受限Boltzmann机和深度置信网络对频谱包络建模以进行统计参数语音合成
3. Acoustic Modeling Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis and Voice Conversion [J] . Zhen-Hua Ling, Ling-Hui Chen, Li-Rong Dai 電子情報通信学会技術研究報告. 音声. Speech . 2013,第366期

机译：使用受限Boltzmann机和Deep Belief网络进行声学建模以进行统计参数语音合成和语音转换
4. Multi-distribution deep belief network for speech synthesis [C] . Kang Shiyin, Qian Xiaojun, Meng Helen IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：用于语音合成的多分布深度置信网络
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Random Deep Belief Networks for Recognizing Emotions from Speech Signals [O] . Guihua Wen, Huihui Li, Jubing Huang, 2017

机译：随机深度信念网络用于从语音信号中识别情绪
7. MULTI-DISTRIBUTION DEEP BELIEF NETWORK FOR SPEECH SYNTHESIS [O] . Shiyin Kang, Xiaojun Qian, Helen Meng 2013

机译：语音合成的多分布深度信任网络

MULTI-DISTRIBUTION DEEP BELIEF NETWORK FOR SPEECH SYNTHESIS

摘要

著录项

相似文献

相关主题

期刊订阅