首页> 外文会议>International CET Conference on Control, Communication, and Computing >Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals
【24h】

Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals

机译:自动识别语音信号中的元音,鼻音和近似词

获取原文

摘要

Automatic speech recognition involves methodologies for translation of spoken language into text. An important problem that needs to be solved for the success of speech recognition is the accurate detection of phonemes. In this paper, a two stage system for spotting the boundaries of vowels, nasals and approximants in Malayalam speech signal is proposed. In the first stage, speech signal is classified into six broad phoneme classes using an Artificial Neural Network based broad phoneme classifier. Classifier with nine features has limited accuracy for detecting vowel, nasal and approximant boundaries. So features like difference of spectral spread, spectral centroid, envelope variance, energy ratio, difference in formant frequencies are added to the classifier. With these additional features, a major improvement in classifier accuracy is achieved. In the second stage, a frequency domain parameter named spectral peak frequency is suggested for accurate verification of nasals. Sonorant and nonsyllabic features are used for verifying approximants and syllabic feature is used for locating vowels.
机译:自动语音识别涉及将口语翻译成文本的方法。语音识别成功需要解决的一个重要问题是音素的准确检测。本文提出了一种用于识别马拉雅拉姆语语音信号中元音,鼻音和近似词边界的两阶段系统。在第一阶段,使用基于人工神经网络的广义音素分类器将语音信号分为六个广义音素类别。具有九种功能的分类器在检测元音,鼻音和近似边界时准确性有限。因此,将诸如频谱扩展差,频谱质心,包络方差,能量比,共振峰频率差之类的特征添加到分类器中。使用这些附加功能,可以大大提高分类器的准确性。在第二阶段中,建议使用一个名为频谱峰值频率的频域参数来精确验证鼻腔。使用Sonorant和非音节特征来验证近似值,并使用音节特征来定位元音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号