首页> 外文期刊>EURASIP journal on audio, speech, and music processing >Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network
【24h】

Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

机译:使用经验模式分解和卷积神经网络改善扰动语音识别

获取原文
           

摘要

In this paper, we use empirical mode decomposition and Hurst-based mode selection (EMDH) along with deeplearning architecture using a convolutional neural network (CNN) to improve the recognition of dysarthric speech. TheEMDH speech enhancement technique is used as a preprocessing step to improve the quality of dysarthric speech.Then, the Mel-frequency cepstral coefficients are extracted from the speech processed by EMDH to be used as inputfeatures to a CNN-based recognizer. The effectiveness of the proposed EMDH-CNN approach is demonstrated by theresults obtained on the Nemours corpus of dysarthric speech. Compared to baseline systems that use Hidden Markovwith Gaussian Mixture Models (HMM-GMMs) and a CNN without an enhancement module, the EMDH-CNN systemincreases the overall accuracy by 20.72% and 9.95%, respectively, using a k-fold cross-validation experimental setup.
机译:在本文中,我们使用经验模式分解和基于赫斯特的模式选择(EMDH)以及使用卷积神经网络(CNN)的解剖架构,以改善扰动语音的识别。 TheEMDH语音增强技术用作预处理步骤,提高缺陷语音的质量。该改变emdh的语音中提取熔融频率谱系数,以用作基于CNN的识别器的输入方法。所提出的EMDH-CNN方法的有效性由在缺陷言论的Nemours语料上获得的结果证明。与使用隐马Markovwith高斯混合模型(HMM-GMMS)和CNN没有增强模块的CNN的基线系统相比,使用K折叠交叉验证实验,EMDH-CNN分别将整体精度和9.95%的整体精度释放到9.95%。设置。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号