Speech Synthesis based on Articulatory-Movement HMMs with Voice-source Codebooks

机译：基于发音运动码本的关节运动HMM的语音合成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech synthesis based on one-model of articulatory movement HMMs, that are commonly applied to both speech recognition (SR) and speech synthesis (SS), is described. In an SS module, speaker-invariant HMMs are applied to generate an articulatory feature (AF) sequence, and then, after converting AFs into vocal tract parameters by using a multilayer neural network (MLN), a speech signal is synthesized through an LSP digital filter. The CELP coding technique is applied to improve voice-sources when generating these sources from embedded codes in the corresponding state of HMMs. The proposed SS module separates phonetic information and the individuality of a speaker. Therefore, the targeted speaker's voice can be synthesized with a small amount of speech data. In the experiments, we carried out listening tests for ten subjects and evaluated both of sound quality and individuality of synthesized speech. As a result, we confirmed that the proposed SS module could produce good quality speech of the targeted speaker even when the training was done with the data set of two-sentences.

机译：描述了基于关节运动HMM的一种模型的语音合成，该模型通常应用于语音识别（SR）和语音合成（SS）。在SS模块中，应用说话者不变的HMM生成发音特征（AF）序列，然后，使用多层神经网络（MLN）将AF转换为声道参数后，通过LSP数字信号合成语音信号筛选。当从HMM对应状态下的嵌入代码生成语音源时，将CELP编码技术应用于改善语音源。提议的SS模块将语音信息和说话者的个性分开。因此，目标讲话者的语音可以与少量语音数据合成。在实验中，我们对十个主题进行了听力测试，并评估了声音质量和合成语音的个性。结果，我们确认，即使使用两个句子的数据集进行训练，所提出的SS模块也可以产生目标讲话者的高质量语音。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1852-1855|共4页
会议地点
作者
Tsuneo Nitta; Takayuki Onoda; Masashi Kimura; Yurie Iribe; Kouichi Katsurada;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speech synthesis; HMM-based speech synthesis; articulatory features; voice source codebook; LSP;

机译：语音合成基于HMM的语音合成;发音特征;语音源代码本; LSP;
入库时间 2022-08-26 15:06:03

相似文献

外文文献
中文文献
专利

1. Discriminative codebook design using multiple vector quantization in HMM-based speech recognizers [J] . Peinado A.M., Segura J.C. IEEE Transactions on Speech and Audio Proceeding . 1996,第2期

机译：基于HMM的语音识别器中使用多个矢量量化的判读码本设计
2. Discriminative codebook design using multiple vector quantizationin HMM-based speech recognizers [J] . Peinado A.M., Segura J.C., Rubio A.J., IEEE Transactions on Speech and Audio Proceessing . 1996,第2期

机译：基于HMM的语音识别器中使用多个矢量量化的判读码本设计
3. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
4. Context-dependent labels for an HMM-based speech synthesis system for Malay HMM-based speech synthesis system for Malay [C] . Mustafa Mumtaz B., Don Zuraidah M., Knowles Gerry 2013 International Conference on Oriental COCOSDA . 2013

机译：用于马来人的基于HMM的语音合成系统的上下文相关标签
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. Discriminative Multi-Stream Postfilters Based on Deep Learning for Enhancing Statistical Parametric Speech Synthesis [O] . Marvin Coto-Jiménez 2021

机译：基于深度学习的判别多流破旧用于增强统计参数致辞综合
7. USING A PITCH-SYNCHRONOUS RESIDUAL CODEBOOK FOR HYBRID HMM/FRAME SELECTION SPEECH SYNTHESIS [O] . Thomas Drugman, Alexis Moinet, Thierry Dutoit, 2010

机译：使用pITCH-sYNCHRONOUs残留代码簿进行混合Hmm /帧选择语音合成

Speech Synthesis based on Articulatory-Movement HMMs with Voice-source Codebooks

摘要

著录项

相似文献

相关主题

期刊订阅