首页> 外文期刊>Signal & Image Processing : An International Journal (SIPIJ) >Formant Analysis of Bangla Vowel for Automatic Speech Recognition
【24h】

Formant Analysis of Bangla Vowel for Automatic Speech Recognition

机译:自动语音识别的孟加拉元音共振峰分析

获取原文
           

摘要

To provide new technological benefits to the mass people, nowadays, regional and local languagerecognition draws attention to the researchers. Similarly to other languages, Bangla speech recognitionscheme is demandable. A formant is considered as the resonance frequency of vocal tract. Formantfrequencies play an important role for the purpose of automatic speech recognition, due to its noise robustcharacteristics. In this paper, Bangla vowels are investigated to acquire formant frequencies and itscorresponding bandwidth from continuous Bangla sentences, which are considered as potential parametersfor wide voice applications. For the purpose of formant analysis, cepstrum based formant estimation andLinear Predictive Coding (LPC) techniques are used. In order to acquire formant characteristics, enrichcontinuous sentences and widely available Bangla language corpus namely “SHRUTI” is considered.Intensive experimentation is carried out to determine formant characteristics (frequency and bandwidth) ofBangla vowels for both male and female speakers. Finally, vowel recognition accuracy of Bangla languageis reported considering first three formants..
机译:为了给大众提供新的技术利益,如今,区域和本地语言识别引起了研究人员的注意。与其他语言类似,孟加拉语语音识别方案是必需的。共振峰被认为是声道的共振频率。共振峰由于其强大的噪声特性,对于自动语音识别起着重要作用。本文对孟加拉语元音进行了研究,以从连续的孟加拉语句子中获取共振峰频率及其相应带宽,这些句子被认为是广泛语音应用的潜在参数。为了进行共振峰分析,使用了基于倒频谱的共振峰估计和线性预测编码(LPC)技术。为了获得共振峰特征,考虑了丰富的连续句子和广泛使用的孟加拉语语料库“ SHRUTI”。进行了深入的实验以确定男女说话者的孟加拉语元音的共振峰特征(频率和带宽)。最后,据报道孟加拉语的元音识别准确度考虑到前三个共振峰。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号