首页> 外文会议>International Conference on speech and computer >Combining Atom Decomposition of the FO Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech
【24h】

Combining Atom Decomposition of the FO Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech

机译:结合FO轨道的原子分解和基于HMM的语音短语建模以实现语音中的稳健压力检测

获取原文

摘要

Weighted Correlation based Atom Decomposition (WCAD) algorithm is a technique for intonation modelling that uses a matching pursuit framework to decompose the FO contour into a set of basic components, called atoms. The atoms attempt to model the physiological activation of the laryngeal muscles responsible for changes in FO. Recently, WCAD has been upgraded to use the orthogonal matching pursuit (OMP) algorithm, which gives qualitative improvements in the modelling of intonation. A possible exploitation of the OMP based WCAD is the automatic detection of stress in speech, which we undertake for the Hungarian language. Correlation is demonstrated between stress and atomic peaks, as well as between stress and atomic valleys on the previous syllable. The stress detection technique based on WCAD is compared to a baseline system using HMM/GMM stress/phrase models. 7 % improvement is noticed in the F-measure compared to baseline when evaluating on hand-made reference. Finally, we propose a hybrid approach which outperforms both individual systems (by 11 % compared to the baseline).
机译:基于加权相关的原子分解(WCAD)算法是一种用于语调建模的技术,该技术使用匹配的追踪框架将FO轮廓分解为一组称为原子的基本成分。原子试图模拟负责FO变化的喉部肌肉的生理激活。最近,WCAD已升级为使用正交匹配追踪(OMP)算法,从而在语调建模方面进行了质量上的改进。基于OMP的WCAD的一种可能的利用是自动检测语音中的压力,这是我们针对匈牙利语言进行的。应力与原子峰值之间,以及上一个音节的应力与原子谷之间都显示出相关性。将基于WCAD的压力检测技术与使用HMM / GMM压力/短语模型的基准系统进行比较。用手工制作的参考进行评估时,与基线相比,F量度可发现7%的改善。最后,我们提出了一种混合方法,其性能优于两个单独的系统(与基准相比,降低了11%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号