...
首页> 外文期刊>Journal of medical systems >Classification of speech dysfluencies using LPC based parameterization techniques
【24h】

Classification of speech dysfluencies using LPC based parameterization techniques

机译:使用基于LPC的参数化技术对语音障碍进行分类

获取原文
获取原文并翻译 | 示例

摘要

The goal of this paper is to discuss and compare three feature extraction methods: Linear Predictive Coefficients (LPC), Linear Prediction Cepstral Coefficients (LPCC) and Weighted Linear Prediction Cepstral Coefficients (WLPCC) for recognizing the stuttered events. Speech samples from the University College London Archive of Stuttered Speech (UCLASS) were used for our analysis. The stuttered events were identified through manual segmentation and were used for feature extraction. Two simple classifiers namely, k-nearest neighbour (kNN) and Linear Discriminant Analysis (LDA) were employed for speech dysfluencies classification. Conventional validation method was used for testing the reliability of the classifier results. The study on the effect of different frame length, percentage of overlapping, value of ? in a first order pre-emphasizer and different order p were discussed. The speech dysfluencies classification accuracy was found to be improved by applying statistical normalization before feature extraction. The experimental investigation elucidated LPC, LPCC and WLPCC features can be used for identifying the stuttered events and WLPCC features slightly outperforms LPCC features and LPC features.
机译:本文的目的是讨论和比较三种特征提取方法:用于识别口吃事件的线性预测系数(LPC),线性预测倒谱系数(LPCC)和加权线性预测倒谱系数(WLPCC)。我们使用伦敦大学口吃演说档案馆(UCLASS)的语音样本进行分析。口吃事件通过手动分割进行识别,并用于特征提取。将两个简单的分类器,即k最近邻(kNN)和线性判别分析(LDA)用于语音流失程度分类。传统的验证方法用于测试分类器结果的可靠性。研究不同帧长,重叠百分比,α值的影响讨论了一阶预加重器和不同阶的p。发现通过在特征提取之前应用统计归一化可以提高语音不良性分类的准确性。实验研究表明,LPC,LPCC和WLPCC功能可用于识别口吃事件,WLPCC功能略胜于LPCC和LPC功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号