首页> 外文会议>International Conference on Inventive Research in Computing Applications >Development of Robust Automatic Speech Recognition System for Children's using Kaldi Toolkit
【24h】

Development of Robust Automatic Speech Recognition System for Children's using Kaldi Toolkit

机译:使用Kaldi Toolkit开发用于儿童的鲁棒自动语音识别系统

获取原文

摘要

In this paper, the Punjabi children speech recognition system is developed using Subspace Gaussian mixture models (SGMM) acoustic modeling techniques. Initially, the system is dependent upon Mel-frequency cepstral coefficients (MFCC) approach for controlling the temporal variations in the input speech signals. Here, SGMM is integrated with HMM to measure the efficiency of each state which carries the information of a short-windowed frame. For handling the children speaker acoustic variations speaker adaptive training (SAT), based on vocal-tract length normalization and feature space maximum likelihood linear regression is adopted. Kaldi and open-source speech recognition toolkit is used to develop the Robust Automatic Speech Recognition (ASR) System for Punjabi Children's speech. S GMM accumulate the frame coefficients and their posterior probabilities and pass these probabilities to HMM which systematically fit the frame and output have resulted from HMM states. Therefore, the achievement of SGMM has gotten a large performance margin in Punjabi children speech recognition. A remarkable depletion in the word error rate (WER) was noticed using SGMM by varying the feature dimensions. The developed children ASR system obtained a recognition accuracy of 83.66% while tested by varying the feature dimensions to 12.
机译:在本文中,使用子空间高斯混合模型(SGMM)声学建模技术开发了旁遮普儿童语音识别系统。最初,该系统依赖于梅尔频率倒谱系数(MFCC)方法来控制输入语音信号中的时间变化。在这里,SGMM与HMM集成在一起以测量每个状态的效率,这些状态携带一个短窗口帧的信息。为了处理儿童说话者的声音变化,采用了基于声道长度归一化和特征空间最大似然线性回归的说话人自适应训练(SAT)。 Kaldi和开源语音识别工具包用于开发旁遮普儿童语音的鲁棒自动语音识别(ASR)系统。 S GMM累加框架系数及其后验概率,并将这些概率传递给HMM,这些概率系统地适合框架并由HMM状态产生。因此,SGMM的成果在旁遮普儿童语音识别中获得了较大的性能优势。通过更改特征尺寸,使用SGMM可以发现单词错误率(WER)的显着减少。通过将特征尺寸更改为12进行测试,开发的儿童ASR系统获得了83.66%的识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号