Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals

机译：从语音信号自动发现元音，鼻腔和近似值

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic speech recognition involves methodologies for translation of spoken language into text. An important problem that needs to be solved for the success of speech recognition is the accurate detection of phonemes. In this paper, a two stage system for spotting the boundaries of vowels, nasals and approximants in Malayalam speech signal is proposed. In the first stage, speech signal is classified into six broad phoneme classes using an Artificial Neural Network based broad phoneme classifier. Classifier with nine features has limited accuracy for detecting vowel, nasal and approximant boundaries. So features like difference of spectral spread, spectral centroid, envelope variance, energy ratio, difference in formant frequencies are added to the classifier. With these additional features, a major improvement in classifier accuracy is achieved. In the second stage, a frequency domain parameter named spectral peak frequency is suggested for accurate verification of nasals. Sonorant and nonsyllabic features are used for verifying approximants and syllabic feature is used for locating vowels.

机译：自动语音识别涉及将口语翻译成文本的方法。需要解决语音识别成功的重要问题是准确地检测音素。在本文中，提出了一种用于发现Malayalam语音信号中的元音，鼻腔和近似剂的边界的两个阶段系统。在第一阶段，使用基于人工神经网络的宽音素分类器被分类为六个宽音素类。具有九个功能的分类器具有有限的准确性，可检测元音，鼻腔和近似边界。因此，在分类器中添加了频谱扩展，光谱质心，包络方差，能量比，频率差的差异等特征。利用这些附加功能，实现了分类器精度的重大改进。在第二阶段，建议用于准确验证界面的频域参数。 SONORANT和Nonsyllabic特征用于验证近似值和音节特征用于定位元音。

著录项

来源
《International CET Conference on Control, Communication, and Computing》|2018年|446p|共6页
会议地点
作者
Shinimol Salim; G Deekshitha; Anu George; Leena Mary;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP-53;
关键词
Frequency measurement; Feature extraction; Neural networks; Energy measurement; Speech recognition; Speech processing; Training;

机译：频率测量;特征提取;神经网络;能量测量;语音识别;语音处理;培训;

相似文献

外文文献
中文文献
专利

1. Automatic syllabification of speech signal using short time energy and vowel onset points [J] . Leena Mary, Anil P. Antony, Ben P. Babu, International journal of speech technology . 2018,第3期

机译：使用短时能量和元音起始点自动语音信号音节化
2. Identification of vowels in consonant-vowel-consonant words from speech imagery based EEG signals [J] . Chengaiyan Sandhya, Retnapandian Anandha Sree, Anandan Kavitha Cognitive Neurodynamics . 2020,第1期

机译：基于语音图像的EEG信号辨识辅音元音辅音词元音
3. Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points [J] . Anil Kumar Vuppala, K. Sreenivasa Rao, Saswat Chakrabarti Circuits, systems, and signal processing . 2012,第4期

机译：利用元音起始点的精确检测从连续语音中识别和识别辅音元音单元
4. Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals [C] . Shinimol Salim, G Deekshitha, Anu George, International CET Conference on Control, Communication, and Computing . 2018

机译：自动识别语音信号中的元音，鼻音和近似词
5. Exploration of Acoustic Features for Automatic Vowel Discrimination in Spontaneous Speech. [D] . Tyson, Na'im R. 2012

机译：探索自发语音中元音自动识别的声学特性。
6. Identification of vowels in consonant–vowel–consonant words from speech imagery based EEG signals [O] . Sandhya Chengaiyan, Anandha Sree Retnapandian, Kavitha Anandan 2020

机译：基于语音图像的EEG信号辨识辅音元音辅音词元音
7. Reliability of rating synthesized hypernasal speech signals in connected speech and vowels [O] . Wong Chun-ho Eddy 2007

机译：在连接的语音和元音中评定合成的过耳语音信号的可靠性
8. Speech Recognition System Test. Identification of Vowels Excerpted from Oral and Nasal Contexts. [R] . bond, z. s. 1976

机译：语音识别系统测试。从口头和鼻上下文中摘录的元音识别。

Automatic Spotting of Vowels, Nasals and Approximants from Speech Signals

摘要

著录项

相似文献

相关主题

期刊订阅