首页> 外文期刊>Computer speech and language >Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features
【24h】

Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features

机译:使用光学特征自动评估与编码电话演讲的扬声器扬声器的可懂度

获取原文
获取原文并翻译 | 示例

摘要

In clinical practice, assessment of intelligibility in speakers with dysarthria is performed by speech-language pathologists through auditory perceptual tests which demand patients' presence at hospital and involve time-consuming examinations. Frequent clinical monitoring can be costly and logistically inconvenient both for patients and medical experts. Here, we aim to automate the procedure of assessment of intelligibility in dysarthric speakers with an objective, speech-based method that can be employed in a telescreening application. The proposed method predicts the level of intelligibility in dysarthric speakers using four levels of speech intelligibility (very low, low, mediocre and high). The study compares several automatic methods to assess the intelligibility level in speakers with dysarthria by utilizing information generated at the level of the vocal folds through glottal features and by using coded telephone speech (i.e. speech that is used in telescreening applications). In addition to the glottal features, the openSMILE features are used as acoustic baseline features. Using features obtained from coded speech utterances and the corresponding intelligibility level labels, multidass-support vector machine (SVM) classifiers are trained. A separate set of multiclass-SVMs are trained using both individual glottal and acoustic features as well as their combinations. Coded telephone speech is generated with the adaptive multi-rate codec with two operational bandwidths (narrowband and wideband), from utterances of an open database of dysarthric speech (Universal Access-Speech). Experimental results showed good classification accuracies for the glottal features, indicating their effectiveness in the intelligibility level assessment in speakers with dysarthria even in the challenging coded condition. Improvement in classification accuracy was obtained when the glottal features were combined with the openSMILE acoustic features, which validate the complimentary nature of the glottal features.
机译:在临床实践中,通过语音病理学家通过语言感知测试进行扬声器的可懂度评估,这些检验需要患者在医院的存在并涉及耗时的考试。患者和医疗专家频繁的临床监测可能是昂贵的和逻辑上的不方便。在这里,我们的目标是自动化具有目标语音的方法在烦躁扬声器中评估可懂度的程序,该方法可以在电话筛选应用中使用。所提出的方法预测使用四个语音清晰度(非常低,低,平庸和高)的烦躁扬声器的可懂度水平。该研究比较了几种自动方法,通过利用声门特征利用声门折叠水平产生的信息和使用编码电话语音(即电话扫描应用程序中使用的语音)来评估扬声器中的扬声器的可清晰度。除了引光功能外,开放式功能用作声学基线特征。使用从编码语音发声和相应的可懂度级标签获得的功能,培训多分批支持向量机(SVM)分类器。使用单独的名录和声学特征以及它们的组合训练一组单独的多字带-SVMS。使用具有两个操作带宽(窄带和宽带)的自适应多速率编解码器产生编码的电话语音,从发狂语音的开放数据库(通用访问语音)的话语。实验结果表明,光泽特征良好的分类准确性,表明它们在令人挑战性的编码条件下,它们在扬声器的可清晰度评估中的有效性。当光门特征与开放式声学特征相结合时,获得了分类精度的提高,验证了光泽特征的互补性质。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号