首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >HMM-based stressed speech modeling with application to improvedsynthesis and recognition of isolated speech under stress
【24h】

HMM-based stressed speech modeling with application to improvedsynthesis and recognition of isolated speech under stress

机译:基于HMM的压力语音建模及其在压力下孤立语音的合成和识别中的应用

获取原文
获取原文并翻译 | 示例

摘要

A novel approach is proposed for modeling speech parameter variations between neutral and stressed conditions and employed in a technique for stressed speech synthesis and recognition. The proposed method consists of modeling the variations in pitch contour, voiced speech duration, and average spectral structure using hidden Markov models (HMMs). While HMMs have traditionally been used for recognition applications, here they are employed to statistically model the characteristics needed for generating pitch contour and spectral perturbation contour patterns to modify the speaking style of isolated neutral words. The proposed HMM models are both speaker and word-independent, but unique to each speaking style. While the modeling scheme is applicable to a variety of stress and emotional speaking styles, the evaluations presented focus on angry speech, the Lombard (1911) effect, and loud spoken speech in three areas. First, formal subjective listener evaluations of the modified speech confirm the HMMs ability to capture the parameter variations under stressed conditions. Second, an objective evaluation using a separately formulated stress classifier is employed to assess the presence of stress imparted on the synthetic speech. Finally, the stressed speech is also used for training and shown to measurably improve the performance of an HMM-based stressed speech recognizer
机译:提出了一种新颖的方法来对中性和压力条件之间的语音参数变化建模,并将其用于压力语音合成和识别技术中。所提出的方法包括使用隐马尔可夫模型(HMM)对音高轮廓,浊音持续时间和平均频谱结构的变化进行建模。尽管传统上将HMM用于识别应用程序,但在这里将它们用于统计建模音高轮廓和频谱扰动轮廓图案所需的特征,以修改孤立的中性词的说话风格。提议的HMM模型既与说话者无关,又与单词无关,但是对于每种说话风格都是唯一的。虽然建模方案适用于多种压力和情感说话风格,但评估主要集中在愤怒的言语,伦巴第(1911)效应和大声讲话三个方面。首先,正式的主观听众对修改语音的评估确认了HMM在压力条件下捕获参数变化的能力。其次,使用单独制定的压力分类器进行客观评估,以评估施加在合成语音上的压力的存在。最后,强调语音也用于训练,并显示出可测量地提高基于HMM的强调语音识别器的性能

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号