...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis
【24h】

Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis

机译:将发音特征集成到基于HMM的参数语音合成中

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents an investigation into ways of integrating articulatory features into hidden Markov model (HMM)-based parametric speech synthesis. In broad terms, this may be achieved by estimating the joint distribution of acoustic and articulatory features during training. This may in turn be used in conjunction with a maximum-likelihood criterion to produce acoustic synthesis parameters for generating speech. Within this broad approach, we explore several variations that are possible in the construction of an HMM-based synthesis system which allow articulatory features to influence acoustic modeling: model clustering, state synchrony and cross-stream feature dependency. Performance is evaluated using the RMS error of generated acoustic parameters as well as formal listening tests. Our results show that the accuracy of acoustic parameter prediction and the naturalness of synthesized speech can be improved when shared clustering and asynchronous-state model structures are adopted for combined acoustic and articulatory features. Most significantly, however, our experiments demonstrate that modeling the dependency between these two feature streams can make speech synthesis systems more flexible. The characteristics of synthetic speech can be easily controlled by modifying generated articulatory features as part of the process of producing acoustic synthesis parameters.
机译:本文对将发音特征集成到基于隐马尔可夫模型(HMM)的参量语音合成中的方法进行了研究。概括地说,这可以通过估计训练过程中声音和关节特征的联合分布来实现。这又可以与最大似然标准结合使用,以产生用于产生语音的声学合成参数。在这种广泛的方法中,我们探索了基于HMM的合成系统构建中可能出现的几种变体,这些变体允许发音特征影响声学建模:模型聚类,状态同步和跨流特征相关性。使用生成的声学参数的RMS误差以及正式的听觉测试来评估性能。我们的结果表明,将共享的聚类和异步状态模型结构用于组合的声学和发音特征时,可以提高声学参数预测的准确性和合成语音的自然性。然而,最重要的是,我们的实验表明,对这两个特征流之间的依赖性进行建模可以使语音合成系统更加灵活。在生成声音合成参数的过程中,可以通过修改生成的发音特征来轻松控制合成语音的特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号