首页> 外文会议> >MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA

【24h】

MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA

机译：基于MFCC的k-NN和LDA识别口吃语音中的重复和延长

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stuttering is a speech disorder in which the normal flow of speech is disrupted by occurrences of dysfluencies, such as repetitions, interjection and so on. There are a high proportion of repetitions and prolongations in stuttered speech, usually at the beginning of sentences. Consequently, acoustic analysis can be used to classify the stuttered events. This paper describes particular stuttering events to be located as repetitions and prolongations in stuttered speech with feature extraction algorithm. The well known Mel Frequency Cepstral Coefficient (MFCC) feature extraction is implemented to test its effectiveness in recognizing prolongations and repetitions in a stuttered speech. In this work, two classifiers such as Linear Discriminant Analysis based classifier (LDA) and k-nearest neighbors (k-NN) are employed and k-fold cross-validation was applied to measure classifiers performances. The result of this work shows that the MFCC and classifiers (LDA and k-NN) can be used for recognition of repetitions and prolongations in stuttered speech with the average accuracy of 90%.

机译：口吃是一种言语障碍，其中正常的语音流因重复性，插入感等不当行为的出现而中断。口吃的重复和延长的比例很高，通常是在句子开头。因此，声学分析可用于对口吃事件进行分类。本文使用特征提取算法描述了特定的口吃事件，以定位和增强口吃语音中的重复性。实施众所周知的梅尔频率倒谱系数（MFCC）特征提取以测试其识别口吃语音中的延长和重复的有效性。在这项工作中，使用了两个分类器，例如基于线性判别分析的分类器（LDA）和k最近邻（k-NN），并且应用了k倍交叉验证来衡量分类器的性能。这项工作的结果表明，MFCC和分类器（LDA和k-NN）可用于识别口吃语音中的重复和延长，平均准确度为90％。

著录项

来源
《》|2009年|146-149|共4页
会议地点
作者
Lim Sin Chee; Ooi Chia Ai; Hariharan M.; Yaacob S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
KNN; LDA; MFCC; Stuttering;

机译：KNN; LDA; MFCC;口吃;

相似文献

外文文献
中文文献
专利

1. Automatic Recognition of Prolongations and Repetitions in Stuttering Speech using ANN [J] . G. Manjula, M. Shiva Kumar, Y. V. Geetha Journal of the Instrument Society of India: Proceedings of the national symposium on instrumentation . 2016,第3期

机译：使用ANN自动识别口吃语音中的延伸和重复
2. Database development and automatic speech recognition of isolated Pashto spoken digits using MFCC and K-NN [J] . Zakir Ali, Arbab Waseem Abbas, T. M. Thasleema, International journal of speech technology . 2015,第2期

机译：使用MFCC和K-NN对孤立的普什图语语音进行数据库开发和自动语音识别
3. Identification and Validation of Repetitions/Prolongations in Stuttering Speech using Epoch Features [J] . G. Manjula M., Shiva Kumar, Y. V. Geetha, International Journal of Applied Engineering Research . 2017,第22aPta2期

机译：使用时期特征识别和验证口吃语音的重复/延长
4. MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA [C] . Lim Sin Chee, Ooi Chia Ai, Hariharan M., Research and Development (SCOReD), 2009 . 2009

机译：基于MFCC的k-NN和LDA识别口吃语音中的重复和延长
5. A speech recognition IC with an efficient MFCC extraction algorithm and multi-mixture models. [D] . Han, Wei. 2006

机译：具有高效MFCC提取算法和多混合模型的语音识别IC。
6. Development of a Two-Stage Procedure for the Automatic Recognition of Dysfluencies in the Speech of Children Who Stutter: II. ANN Recognition of Repetitions and Prolongations With Supplied Word Segment Markers [O] . Peter Howell, Stevie Sackin, Kazan Glenn -1

机译：自动识别口吃儿童言语中流离失所的两阶段程序的发展：II。具有提供的词段标记的ANN识别重复和延长
7. Deep Learning Bidirectional LSTM based Detection of Prolongation and Repetition in Stuttered Speech using Weighted MFCC [O] . Sakshi Gupta, Ravi S., Rajesh K., 2020

机译：基于深度学习的双向LSTM检测用加权MFCC的延长和重复检测

MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA

摘要

著录项

相似文献

相关主题

期刊订阅