首页> 外文会议>Annual Conference of the International Speech Communication Association >Predicting severity of voice disorder from DNN-HMM acoustic posteriors
【24h】

Predicting severity of voice disorder from DNN-HMM acoustic posteriors

机译:预测DNN-HMM声学后遗症的语音障碍严重程度

获取原文

摘要

Acoustical analysis of speech is considered a favorable and promising approach to objective assessment of voice disorders. Previous research emphasized on the extraction and classification of voice quality features from sustained vowel sounds. In this paper, an investigation on voice assessment using continuous speech utterances of Cantonese is presented. A DNN-HMM based speech recognition system is trained with speech data of unimpaired voice. The recognition accuracy for pathological utterances is found to decrease significantly with the disorder severity increasing. Average acoustic posterior probabilities are computed for individual phones from the speech recognition output lattices and the DNN soft-max layer. The phone posteriors obtained for continuous speech from the mild, moderate and severe categories are highly distinctive and thus useful to the determination of voice disorder severity. A subset of Cantonese phonemes are identified to be suitable and reliable for voice assessment with continuous speech.
机译:言论的声学分析被认为是对客观评估语音障碍的有利和有希望的方法。以前的研究强调了从持续的元音声音的语音质量特征的提取和分类。本文介绍了使用粤语连续语音话语的语音评估调查。基于DNN-HMM的语音识别系统训练,具有未受害声音的语音数据。发现病理话语的识别准确性随着疾病严重程度的增加而显着降低。从语音识别输出格子和DNN软最大层的单个手机计算平均声学后验概率。从轻度,中度和严重类别的连续演讲获得的手机后续是非常独特的,因此可用于测定语音障碍严重程度。粤语音素的子集被识别为具有连续语音的语音评估合适可靠。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号