首页> 外文会议>International conference on advances in computing, communications and informatics >Optimizing feature extraction techniques constituting phone based modelling on connected words for Punjabi automatic speech recognition
【24h】

Optimizing feature extraction techniques constituting phone based modelling on connected words for Punjabi automatic speech recognition

机译:基于旁联词的电话建模的特征提取技术优化,用于旁遮普语自动语音识别

获取原文

摘要

Punjabi phoneme sounds are tonal in nature which dissent in most regions of Punjab. Recent research works reveal less significant work done towards developing a speech recognition system in Punjabi. The work done will feature out variability in the correctness and accuracy of various feature extraction techniques. Following paper objects the application of Automatic Speech Recognition on connected words instituting HTK toolkit modelled on Hidden Markov Model (HMM) to build the system. Back-end of the system was braced for 150 distinct Punjabi words from 16 distinct speakers from noise-free corpus and 12 speakers were indulged for the collection of noisy corpus including both male and female. In the phrase of speech recognition, the proposed Feature Extractor we use Front end techniques as “power normalized cepstral coefficients (PNCC)”, “Mel frequency cepstral coefficients (MFCC)” and “Perceptual Linear Prediction (PLP)” following a statistical comparison based on the accuracy and correctness of results attained. To attain a higher rate of accuracy level 34 phones for Punjabi language are used to break each word into small sound frames. Hence, the comparison based on the nature of training and testing environment will aid in framing a vital speech recognition system for Punjabi language.
机译:旁遮普语的音素本质上是音调,在旁遮普的大多数地区都不同。最近的研究工作表明,在旁遮普开发语音识别系统方面所做的工作不那么重要。完成的工作将以各种特征提取技术的正确性和准确性为特征。接下来的论文针对自动语音识别在关联单词上的应用,建立了以隐马尔可夫模型(HMM)为模型的HTK工具包来构建系统。该系统的后端支持来自16个来自无噪声语料库的不同说话者的150个不同的旁遮普语,并且沉迷于12个说话者收集包括男性和女性在内的嘈杂语料库。在语音识别的过程中,基于统计比较,建议的特征提取器使用前端技术作为“功率归一化倒谱系数(PNCC)”,“梅尔频率倒谱系数(MFCC)”和“感知线性预测(PLP)”结果的准确性和正确性。为了获得更高的准确性,使用34种支持旁遮普语的电话将每个单词分解为较小的声音帧。因此,基于培训和测试环境的性质进行的比较将有助于为旁遮普语构建重要的语音识别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号