Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals

机译：自动识别语音信号中的元音，鼻音和近似词

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic speech recognition involves methodologies for translation of spoken language into text. An important problem that needs to be solved for the success of speech recognition is the accurate detection of phonemes. In this paper, a two stage system for spotting the boundaries of vowels, nasals and approximants in Malayalam speech signal is proposed. In the first stage, speech signal is classified into six broad phoneme classes using an Artificial Neural Network based broad phoneme classifier. Classifier with nine features has limited accuracy for detecting vowel, nasal and approximant boundaries. So features like difference of spectral spread, spectral centroid, envelope variance, energy ratio, difference in formant frequencies are added to the classifier. With these additional features, a major improvement in classifier accuracy is achieved. In the second stage, a frequency domain parameter named spectral peak frequency is suggested for accurate verification of nasals. Sonorant and nonsyllabic features are used for verifying approximants and syllabic feature is used for locating vowels.

机译：自动语音识别涉及将口语翻译成文本的方法。语音识别成功需要解决的一个重要问题是音素的准确检测。本文提出了一种用于识别马拉雅拉姆语语音信号中元音，鼻音和近似词边界的两阶段系统。在第一阶段，使用基于人工神经网络的广义音素分类器将语音信号分为六个广义音素类别。具有九种功能的分类器在检测元音，鼻音和近似边界时准确性有限。因此，将诸如频谱扩展差，频谱质心，包络方差，能量比，共振峰频率差之类的特征添加到分类器中。使用这些附加功能，可以大大提高分类器的准确性。在第二阶段中，建议使用一个名为频谱峰值频率的频域参数来精确验证鼻腔。使用Sonorant和非音节特征来验证近似值，并使用音节特征来定位元音。

著录项

来源
《International CET Conference on Control, Communication, and Computing》|2018年|272-277|共6页
会议地点 Trivandrum(IN)
作者
Shinimol Salim; G Deekshitha; Anu George; Leena Mary;
展开▼
作者单位

Dept of ECE Rajiv Gandhi Institute of Technology Kottayam Kerala India;

Dept of ECE Government Engineering College Idukki Kerala India;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Frequency measurement; Feature extraction; Neural networks; Energy measurement; Speech recognition; Speech processing; Training;

机译：频率测量；特征提取;神经网络;能量测量；语音识别;语音处理；训练;

相似文献

外文文献
中文文献
专利

1. Automatic syllabification of speech signal using short time energy and vowel onset points [J] . Leena Mary, Anil P. Antony, Ben P. Babu, International journal of speech technology . 2018,第3期

机译：使用短时能量和元音起始点自动语音信号音节化
2. Identification of vowels in consonant-vowel-consonant words from speech imagery based EEG signals [J] . Chengaiyan Sandhya, Retnapandian Anandha Sree, Anandan Kavitha Cognitive Neurodynamics . 2020,第1期

机译：基于语音图像的EEG信号辨识辅音元音辅音词元音
3. Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points [J] . Anil Kumar Vuppala, K. Sreenivasa Rao, Saswat Chakrabarti Circuits, systems, and signal processing . 2012,第4期

机译：利用元音起始点的精确检测从连续语音中识别和识别辅音元音单元
4. Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals [C] . Shinimol Salim, G Deekshitha, Anu George, International CET Conference on Control, Communication, and Computing . 2018

机译：从语音信号自动发现元音，鼻腔和近似值
5. Exploration of Acoustic Features for Automatic Vowel Discrimination in Spontaneous Speech. [D] . Tyson, Na'im R. 2012

机译：探索自发语音中元音自动识别的声学特性。
6. Identification of vowels in consonant–vowel–consonant words from speech imagery based EEG signals [O] . Sandhya Chengaiyan, Anandha Sree Retnapandian, Kavitha Anandan 2020

机译：基于语音图像的EEG信号辨识辅音元音辅音词元音
7. Reliability of rating synthesized hypernasal speech signals in connected speech and vowels [O] . Wong Chun-ho Eddy 2007

机译：在连接的语音和元音中评定合成的过耳语音信号的可靠性
8. Speech Recognition System Test. Identification of Vowels Excerpted from Oral and Nasal Contexts. [R] . bond, z. s. 1976

机译：语音识别系统测试。从口头和鼻上下文中摘录的元音识别。

Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals

摘要

著录项

相似文献

相关主题

期刊订阅