首页> 外文OA文献 >The development of voice disorder evaluation system based on dysphonia severity index
【2h】

The development of voice disorder evaluation system based on dysphonia severity index

机译:基于语音障碍严重程度指标的语音障碍评估系统的开发

摘要

Voice disorder is dramatically increasing due to the unhealthy social habits such as smoking and alcohol consumption, voice abuse, and the most importantly the lack of awareness among the general public and from the health care provider. Objective non-invasive multiparameter voice assessment is seen as a way to improve the voice rehabilitation process by allowing home care at own responsibility. The purpose of the research is to develop an automatic voice diagnostic system based on objective non-invasive multiparameter method known as Dysphonia Severity Index (DSI). DSI consists of four parameters which are the highest pitch, jitter percentage, lowest intensity, and maximum phonation time. They are combined into a linear regression equation that will give values from -5 to +5 indicating severely dysphonic voice or normal voice respectively. The proposed system is named as Automatic Dysphonia Evaluation System (ADES). It integrates a new proposed pitch detection algorithm (PDA), start/end point detection algorithm, jitter equation, and intensity equation to obtain the four DSI parameters allowing the system to be used by patient at home to monitor their voices. The proposed PDA was proven more accurate by having no error detected for normal voice while only one pathological voice was detected with doubling error. The modified start/end point detection algorithm is proven better with silence detection error rate of 0.0752. ADES was tested with KayPENTAX voice database and had 55.6054% sensitivity and 50% specificity when -6.7249 is used as the cutoff value. Different sets of database consisted of trained and untrained vocalists, and also teachers and non-teachers were also used to evaluate ADES’ performance. The results of ADES show that it is able to get the DSI values for different voices from different types of groups.
机译:由于不健康的社会习惯,例如吸烟和饮酒,滥用声音,以及最重要的是,公众和医疗保健提供者缺乏认识,导致语音障碍急剧增加。客观的非侵入性多参数语音评估被视为通过允许家庭护理自行承担责任来改善语音康复过程的一种方式。该研究的目的是开发一种基于客观非侵入性多参数方法的自动语音诊断系统,该方法被称为语音困难程度指数(DSI)。 DSI由四个参数组成,它们是最高音调,抖动百分比,最低强度和最大发声时间。它们被组合成线性回归方程,该方程将给出从-5到+5的值,分别表示严重的重音语音或正常语音。拟议中的系统被称为自动语音困难评估系统(ADES)。它集成了新提出的音高检测算法(PDA),起点/终点检测算法,抖动方程式和强度方程式,以获得四个DSI参数,允许患者在家中使用该系统监视他们的声音。事实证明,所提议的PDA更准确,因为没有检测到正常语音的错误,而仅检测到一种病理性语音,其错误加倍。改进的起点/终点检测算法被证明具有0.0752的静音检测错误率更好。当使用-6.7249作为截止值时,ADES已通过KayPENTAX语音数据库进行了测试,灵敏度为55.6054%,特异性为50%。不同的数据库集由训练有素和未经训练的歌手组成,并且还使用教师和非教师来评估ADES的表现。 ADES的结果表明,它能够从不同类型的组中获得不同语音的DSI值。

著录项

  • 作者

    Jamaludin Mohd. Redzuan;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号