Predicting severity of voice disorder from DNN-HMM acoustic posteriors

机译：预测DNN-HMM声学后遗症的语音障碍严重程度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Acoustical analysis of speech is considered a favorable and promising approach to objective assessment of voice disorders. Previous research emphasized on the extraction and classification of voice quality features from sustained vowel sounds. In this paper, an investigation on voice assessment using continuous speech utterances of Cantonese is presented. A DNN-HMM based speech recognition system is trained with speech data of unimpaired voice. The recognition accuracy for pathological utterances is found to decrease significantly with the disorder severity increasing. Average acoustic posterior probabilities are computed for individual phones from the speech recognition output lattices and the DNN soft-max layer. The phone posteriors obtained for continuous speech from the mild, moderate and severe categories are highly distinctive and thus useful to the determination of voice disorder severity. A subset of Cantonese phonemes are identified to be suitable and reliable for voice assessment with continuous speech.

机译：言论的声学分析被认为是对客观评估语音障碍的有利和有希望的方法。以前的研究强调了从持续的元音声音的语音质量特征的提取和分类。本文介绍了使用粤语连续语音话语的语音评估调查。基于DNN-HMM的语音识别系统训练，具有未受害声音的语音数据。发现病理话语的识别准确性随着疾病严重程度的增加而显着降低。从语音识别输出格子和DNN软最大层的单个手机计算平均声学后验概率。从轻度，中度和严重类别的连续演讲获得的手机后续是非常独特的，因此可用于测定语音障碍严重程度。粤语音素的子集被识别为具有连续语音的语音评估合适可靠。

著录项

来源
《Annual Conference of the International Speech Communication Association》|2016年|744p|共5页
会议地点
作者
Tan Lee; Yuanyuan Liu; Yu Ting Yeung; Thomas K.T. Law; Kathy Y.S. Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB95-53;
关键词

相似文献

外文文献
中文文献
专利

1. Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features [J] . Liu Yuanyuan, Lee Tan, Law Thomas, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第6期

机译：使用ASR后验特征对连续语音的语音障碍进行声学评估
2. Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features [J] . Liu Yuanyuan, Lee Tan, Law Thomas, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第6期

机译：使用ASR后部特征的语音障碍的声学评估
3. Severity of voice disorders: integration of perceptual and acoustic data in dysphonic patients [J] . Débora Pontes Cavalcante, Leonardo Wanderley Lopes, Priscila Oliveira Da Costa CoDAS . 2014,第5期

机译：声音障碍的严重程度：语音障碍患者的感知和听觉数据整合
4. Predicting severity of voice disorder from DNN-HMM acoustic posteriors [C] . Tan Lee, Yuanyuan Liu, Yu Ting Yeung, Annual Conference of the International Speech Communication Association . 2016

机译：预测DNN-HMM声学后遗症的语音障碍严重程度
5. A Posterior Predictive Model Checking Method Assuming Posterior Normality for Item Response Theory. [D] . Kuhfeld, Megan Rebecca. 2016

机译：假设项目反应理论为后验正态性的后验预测模型检验方法。
6. Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology [O] . James C. Mundt, Peter J. Snyder, Michael S. Cannizzaro, -1

机译：通过交互式语音应答（IVR）技术收集的抑郁症严重程度和治疗应答的语音声学测量
7. Severity of voice disorders: integration of perceptual and acoustic data in dysphonic patients [O] . Leonardo Wanderley Lopes, Débora Pontes Cavalcante, Priscila Oliveira da Costa 2014

机译：语音障碍的严重程度：困扰患者中感知和声学数据的集成

Predicting severity of voice disorder from DNN-HMM acoustic posteriors

摘要

著录项

相似文献

相关主题

期刊订阅