...
首页> 外文期刊>EURASIP journal on audio, speech, and music processing >Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance
【24h】

Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance

机译:语音老化:语音参数变化对ASR性能的影响

获取原文
   

获取外文期刊封面封底 >>

       

摘要

With ageing, human voices undergo several changes which are typically characterized by increased hoarseness and changes in articulation patterns. In this study, we have examined the effect on Automatic Speech Recognition (ASR) and found that the Word Error Rates (WER) on older voices is 10% absolute higher compared to those of adult voices. Subsequently, we compared several voice source parameters including fundamental frequency, jitter, shimmer, harmonicity, and cepstral peak prominence of adult and older males. Several of these parameters show statistically significant difference for the two groups. However, artificially increasing jitter and shimmer measures do not effect the ASR accuracies significantly. Artificially lowering the fundamental frequency degrades the ASR performance marginally but this drop in performance can be overcome to some extent using Vocal Tract Length Normalisation (VTLN). Overall, we observe that the changes in the voice source parameters do not have a significant impact on ASR performance. Comparison of the likelihood scores of all the phonemes for the two age groups show that there is a systematic mismatch in the acoustic space of the two age groups. Comparison of the phoneme recognition rates show that mid vowels, nasals, and phonemes that depend on the ability to create constrictions with tongue tip for articulation are more affected by ageing than other phonemes.
机译:随着年龄的增长,人的声音会经历几种变化,这些变化通常以声音嘶哑和发音模式变化为特征。在这项研究中,我们检查了对自动语音识别(ASR)的影响,发现与成人语音相比,老年语音的误码率(WER)绝对高10%。随后,我们比较了几个语音源参数,包括成年男性和老年男性的基本频率,抖动,闪烁,谐波和倒频谱峰值。这些参数中的几个在两组中显示出统计学上的显着差异。但是,人为增加抖动和闪光措施不会显着影响ASR的准确性。人为降低基频会稍微降低ASR性能,但可以使用人声道长度归一化(VTLN)在某种程度上克服性能下降。总体而言,我们观察到语音源参数的变化不会对ASR性能产生重大影响。比较两个年龄组的所有音素的似然分数,可以看出两个年龄组的声音空间存在系统性的不匹配。音素识别率的比较显示,依赖于用舌尖产生缩窄来发音的能力的中元音,鼻音和音素比其他音素受衰老的影响更大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号