首页> 外文会议>International CET Conference on Control, Communication, and Computing >Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals
【24h】

Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals

机译:从语音信号自动发现元音,鼻腔和近似值

获取原文

摘要

Automatic speech recognition involves methodologies for translation of spoken language into text. An important problem that needs to be solved for the success of speech recognition is the accurate detection of phonemes. In this paper, a two stage system for spotting the boundaries of vowels, nasals and approximants in Malayalam speech signal is proposed. In the first stage, speech signal is classified into six broad phoneme classes using an Artificial Neural Network based broad phoneme classifier. Classifier with nine features has limited accuracy for detecting vowel, nasal and approximant boundaries. So features like difference of spectral spread, spectral centroid, envelope variance, energy ratio, difference in formant frequencies are added to the classifier. With these additional features, a major improvement in classifier accuracy is achieved. In the second stage, a frequency domain parameter named spectral peak frequency is suggested for accurate verification of nasals. Sonorant and nonsyllabic features are used for verifying approximants and syllabic feature is used for locating vowels.
机译:自动语音识别涉及将口语翻译成文本的方法。需要解决语音识别成功的重要问题是准确地检测音素。在本文中,提出了一种用于发现Malayalam语音信号中的元音,鼻腔和近似剂的边界的两个阶段系统。在第一阶段,使用基于人工神经网络的宽音素分类器被分类为六个宽音素类。具有九个功能的分类器具有有限的准确性,可检测元音,鼻腔和近似边界。因此,在分类器中添加了频谱扩展,光谱质心,包络方差,能量比,频率差的差异等特征。利用这些附加功能,实现了分类器精度的重大改进。在第二阶段,建议用于准确验证界面的频域参数。 SONORANT和Nonsyllabic特征用于验证近似值和音节特征用于定位元音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号