Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis

Zhen-Hua Ling; Richmond K.; Yamagishi J.; Ren-Hua Wang

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis

【24h】

Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis

机译：将发音特征集成到基于HMM的参数语音合成中

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an investigation into ways of integrating articulatory features into hidden Markov model (HMM)-based parametric speech synthesis. In broad terms, this may be achieved by estimating the joint distribution of acoustic and articulatory features during training. This may in turn be used in conjunction with a maximum-likelihood criterion to produce acoustic synthesis parameters for generating speech. Within this broad approach, we explore several variations that are possible in the construction of an HMM-based synthesis system which allow articulatory features to influence acoustic modeling: model clustering, state synchrony and cross-stream feature dependency. Performance is evaluated using the RMS error of generated acoustic parameters as well as formal listening tests. Our results show that the accuracy of acoustic parameter prediction and the naturalness of synthesized speech can be improved when shared clustering and asynchronous-state model structures are adopted for combined acoustic and articulatory features. Most significantly, however, our experiments demonstrate that modeling the dependency between these two feature streams can make speech synthesis systems more flexible. The characteristics of synthetic speech can be easily controlled by modifying generated articulatory features as part of the process of producing acoustic synthesis parameters.

机译：本文对将发音特征集成到基于隐马尔可夫模型（HMM）的参量语音合成中的方法进行了研究。概括地说，这可以通过估计训练过程中声音和关节特征的联合分布来实现。这又可以与最大似然标准结合使用，以产生用于产生语音的声学合成参数。在这种广泛的方法中，我们探索了基于HMM的合成系统构建中可能出现的几种变体，这些变体允许发音特征影响声学建模：模型聚类，状态同步和跨流特征相关性。使用生成的声学参数的RMS误差以及正式的听觉测试来评估性能。我们的结果表明，将共享的聚类和异步状态模型结构用于组合的声学和发音特征时，可以提高声学参数预测的准确性和合成语音的自然性。然而，最重要的是，我们的实验表明，对这两个特征流之间的依赖性进行建模可以使语音合成系统更加灵活。在生成声音合成参数的过程中，可以通过修改生成的发音特征来轻松控制合成语音的特征。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2009年第6期|p.1171-1185|共15页
作者
Zhen-Hua Ling; Richmond K.; Yamagishi J.; Ren-Hua Wang;
展开▼
作者单位

iFlytek Speech Lab., Univ. of Sci. & Technol. of China, Hefei;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
acoustic signal processing; feature extraction; hidden Markov models; maximum likelihood estimation; speech synthesis; HMM-based parametric speech synthesis; acoustic parameter prediction; acoustic synthesis parameter; articulatory feature; asynchronous-state model structure; cross-stream feature dependency; hidden Markov model; maximum-likelihood criterion; shared clustering system; Articulatory features; hidden Markov model (HMM); speech production;

机译：声信号处理;特征提取;隐马尔可夫模型;最大似然估计;语音合成;基于HMM的参量语音合成;声学参数预测;声学综合参数;发音特征;异步状态模型结构;跨流特征相关性;隐马尔可夫模型;最大似然准则;共享聚类系统;发音特征;隐马尔可夫模型（HMM）;语音产生;

相似文献

外文文献
中文文献
专利

1. An HMM-based speech recognizer using overlapping articulatory features [J] . Kevin Erler, George H. Freeman The Journal of the Acoustical Society of America . 1996,第4期

机译：基于HMM的语音识别器，使用重叠的发音特征
2. Integration of Spectral Feature Extraction and Modeling for HMM-Based Speech Synthesis [J] . Kazuhiro NAKAMURA, Kei HASHIMOTO, Yoshihiko NANKAKU, IEICE transactions on information and systems . 2014,第6期

机译：基于HMM的语音合成的频谱特征提取与建模集成
3. Estimation of articulatory movements from speech acoustics using an HMM-based speech production model [J] . Hiroya S., Honda M. IEEE Transactions on Speech and Audio Proceessing . 2004,第2期

机译：使用基于HMM的语音产生模型估计语音声学中的发音运动
4. Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-based Speech Synthesis [C] . Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi Annual conference of the International Speech Communication Association;INTERSPEECH 2011 . 2011

机译：基于HMM语音合成的发音控制的统一声学发音模型中的特征空间变换绑定
5. Articulatory speech synthesis and speech production modelling. [D] . Huang, Jun. 2001

机译：发音语音合成和语音产生建模。
6. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion [O] . Prasanta Kumar Ghosh, Shrikanth Narayanan -1

机译：使用从独立于受试者的声学到发音反转的发音特征进行自动语音识别
7. Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis [O] . Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi, 2009

机译：将发音特征整合到基于Hmm的参数语音合成中
8. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages [R] . Ore, B. M. 2009

机译：语音识别，发音特征检测和多语言语音合成

Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅