首页> 外文会议>Annual International Conference of the IEEE Engineering in Medicine and Biology Society >Screening and analysis of specific language impairment in young children by analyzing the textures of speech signal
【24h】

Screening and analysis of specific language impairment in young children by analyzing the textures of speech signal

机译:通过分析语音信号的纹理来筛选和分析幼儿特定语言障碍

获取原文

摘要

A child having a delayed development in language skills without any reason is known to be suffering from specific language impairment (SLI). Unfortunately, almost 7% kindergarten children are reported with SLI in their childhood. The SLI could be treated if identified at an early stage, but diagnosing SLI at early stage is challenging. In this article, we propose a machine learning based system to screen the SLI speech by analyzing the texture of the speech utterances. The texture of speech signals is extracted from the popular time-frequency representation called spectrograms. These spectrogram acts like a texture image and the textural features to capture the change in audio quality such as Haralick’s feature and local binary patterns (LBPs) are extracted from these textural images. The experiments are performed on 4214 utterances taken from 44 healthy and 54 SLI speakers. Experimental results with 10-fold cross validation, indicates that a very good accuracy up to 97.41% is obtained when only 14 dimensional Haralick’s feature is used. The accuracy is slightly boosted up to 99% when the 59-dimensional LBPs are amalgamated with Haralick’s features. The sensitivity and specificity of the whole system is up to 98.96% and 99.20% respectively. The proposed method is gender and speaker independent and invariant to examination conditions.
机译:已知语言技能发育迟缓而没有任何原因的孩子患有特定的语言障碍(SLI)。不幸的是,据报道有将近7%的幼儿园儿童在童年时期患有SLI。如果在早期发现SLI,就可以进行治疗,但是在早期诊断SLI是具有挑战性的。在本文中,我们提出了一种基于机器学习的系统,通过分析语音发声的纹理来筛选SLI语音。语音信号的纹理是从流行的称为频谱图的时频表示中提取的。这些频谱图的作用类似于纹理图像,而纹理特征可捕获音频质量的变化,例如Haralick的特征,并从这些纹理图像中提取局部二进制模式(LBP)。实验是从44位健康的说话者和54位SLI说话者的4214条发声中进行的。进行10倍交叉验证的实验结果表明,仅使用14维Haralick的特征时,可以获得高达97.41%的非常好的准确性。当59维LBP与Haralick的功能融合在一起时,其精度会略微提高到99%。整个系统的灵敏度和特异性分别达到98.96%和99.20%。所提出的方法是性别和说话者无关的,并且对于检查条件是不变的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号