首页> 外文期刊>Medical and Biological Engineering and Computing: Journal of the International Federation for Medical and Biological Engineering >Acoustical recognition of laryngeal pathology: a comparison of two strategies based on sets of features.
【24h】

Acoustical recognition of laryngeal pathology: a comparison of two strategies based on sets of features.

机译:喉部病理学的声学识别:基于特征集的两种策略的比较。

获取原文
获取原文并翻译 | 示例
       

摘要

The efficiency of sets of acoustical features discriminating pathological voices from control voices is reported. Two strategies were compared. The first (called the 'distance strategy') was built upon a statistical distance of voice features to reference values obtained for a set of healthy (reference) voices. The second strategy (called the 'range strategy') is based on the position inside or outside normal ranges established from a reference population; results based on this strategy were presented in a previous paper. Reference values were calculated from a database of 200 healthy voices distributed into 10-year age groups ranging from 20 to 70. Comparisons were made using a second database of 220 voices, including 65 control, 51 functional dysphonia, 50 with nodules on the vocal folds and 54 recurrent nerve palsy. The phonetic material was compared of 17 French vowels: 11 vowels in a sentence, three isolated vowels and three segments (beginning, middle and end) of the sustained vowel /a/. Four acoustical features were considered for each vowel: the voice fundamental (f0) and the first three formant frequencies. Acoustical features were calculated on an ILS (Interactive Laboratory System) analysis system (workstation). The separation of each pathological group from the control group, using sets of acoustical features, was statistically assessed. From the strategy point of view, results indicated that (i) the fundamental frequency f0 was the best measure to separate normal from pathological voices with the distance strategy; (ii) when the formants were taken, the range strategy performed better in separating the voices. For classification of pathologies, the best separation coefficients were obtained with nodules and the worst with recurrent nerve palsy. Overall, it was seen that the separation between control and pathological voices was most efficient when measured using the distance strategy for f0. The range strategy was useful with formant frequencies.
机译:报告了将病理性语音与控制性语音区分开的一系列声学特征的效率。比较了两种策略。第一个(称为“距离策略”)建立在语音特征与从一组健康(参考)语音获得的参考值的统计距离上。第二种策略(称为“范围策略”)基于从参考人群建立的正常范围之内或之外的位置;在先前的论文中介绍了基于该策略的结果。参考值是从分布在20至70岁的10岁年龄组中的200种健康声音的数据库中计算出来的。使用第二种220种声音的数据库进行了比较,包括65个对照,51个功能障碍,50个声带结节。和54例复发性神经麻痹。比较了17个法语元音的语音材料:一个句子中的11个元音,三个独立的元音和持续元音/ a /的三个部分(开头,中间和结尾)。每个元音考虑了四个声学特征:语音基音(f0)和前三个共振峰频率。在ILS(交互式实验室系统)分析系统(工作站)上计算了声学特征。使用一组声学特征,统计评估每个病理组与对照组的分离。从策略的角度来看,结果表明:(i)基本频率f0是用距离策略将正常声音与病理声音分开的最佳方法; (ii)当采取共振峰时,距离策略在分离声音方面表现更好。对于病理学分类,结节获得最佳的分离系数,复发性神经麻痹获得最差的分离系数。总体而言,可以看出,使用f0的距离策略进行测量时,控制声音和病理声音之间的分离最有效。范围策略对于共振峰频率很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号