首页> 外文会议> >MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA
【24h】

MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA

机译:基于MFCC的k-NN和LDA识别口吃语音中的重复和延长

获取原文

摘要

Stuttering is a speech disorder in which the normal flow of speech is disrupted by occurrences of dysfluencies, such as repetitions, interjection and so on. There are a high proportion of repetitions and prolongations in stuttered speech, usually at the beginning of sentences. Consequently, acoustic analysis can be used to classify the stuttered events. This paper describes particular stuttering events to be located as repetitions and prolongations in stuttered speech with feature extraction algorithm. The well known Mel Frequency Cepstral Coefficient (MFCC) feature extraction is implemented to test its effectiveness in recognizing prolongations and repetitions in a stuttered speech. In this work, two classifiers such as Linear Discriminant Analysis based classifier (LDA) and k-nearest neighbors (k-NN) are employed and k-fold cross-validation was applied to measure classifiers performances. The result of this work shows that the MFCC and classifiers (LDA and k-NN) can be used for recognition of repetitions and prolongations in stuttered speech with the average accuracy of 90%.
机译:口吃是一种言语障碍,其中正常的语音流因重复性,插入感等不当行为的出现而中断。口吃的重复和延长的比例很高,通常是在句子开头。因此,声学分析可用于对口吃事件进行分类。本文使用特征提取算法描述了特定的口吃事件,以定位和增强口吃语音中的重复性。实施众所周知的梅尔频率倒谱系数(MFCC)特征提取以测试其识别口吃语音中的延长和重复的有效性。在这项工作中,使用了两个分类器,例如基于线性判别分析的分类器(LDA)和k最近邻(k-NN),并且应用了k倍交叉验证来衡量分类器的性能。这项工作的结果表明,MFCC和分类器(LDA和k-NN)可用于识别口吃语音中的重复和延长,平均准确度为90%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号