Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

Mohammed Sidi Yakoub; Sid-ahmed Selouani; Brahim-Fares Zaidi; Asma Bouchair

首页> 外文期刊>EURASIP journal on audio, speech, and music processing >Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

【24h】

Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

机译：使用经验模式分解和卷积神经网络改善扰动语音识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

In this paper, we use empirical mode decomposition and Hurst-based mode selection (EMDH) along with deeplearning architecture using a convolutional neural network (CNN) to improve the recognition of dysarthric speech. TheEMDH speech enhancement technique is used as a preprocessing step to improve the quality of dysarthric speech.Then, the Mel-frequency cepstral coefficients are extracted from the speech processed by EMDH to be used as inputfeatures to a CNN-based recognizer. The effectiveness of the proposed EMDH-CNN approach is demonstrated by theresults obtained on the Nemours corpus of dysarthric speech. Compared to baseline systems that use Hidden Markovwith Gaussian Mixture Models (HMM-GMMs) and a CNN without an enhancement module, the EMDH-CNN systemincreases the overall accuracy by 20.72% and 9.95%, respectively, using a k-fold cross-validation experimental setup.

机译：在本文中，我们使用经验模式分解和基于赫斯特的模式选择（EMDH）以及使用卷积神经网络（CNN）的解剖架构，以改善扰动语音的识别。 TheEMDH语音增强技术用作预处理步骤，提高缺陷语音的质量。该改变emdh的语音中提取熔融频率谱系数，以用作基于CNN的识别器的输入方法。所提出的EMDH-CNN方法的有效性由在缺陷言论的Nemours语料上获得的结果证明。与使用隐马Markovwith高斯混合模型（HMM-GMMS）和CNN没有增强模块的CNN的基线系统相比，使用K折叠交叉验证实验，EMDH-CNN分别将整体精度和9.95％的整体精度释放到9.95％。设置。

著录项

来源
《EURASIP journal on audio, speech, and music processing》 |2020年第1期|共7页
作者
Mohammed Sidi Yakoub; Sid-ahmed Selouani; Brahim-Fares Zaidi; Asma Bouchair;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
DysarthriaEmpirical mode decompositionHurst mode selectionConvolutional neural network;

机译：DyStriaMirical模式分解模式选择旋转神经网络;

Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

摘要

著录项

相关主题

期刊订阅