首页> 美国政府科技报告 >Speech Coding and Phoneme Classification Using a Back-Propagation Neural Network

【24h】

Speech Coding and Phoneme Classification Using a Back-Propagation Neural Network

机译：使用反向传播神经网络的语音编码和音素分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech is a natural, unspecialized method of communication that is perhaps the ultimate machine interface. Previous attempts to provide such an interface, however, have been limited to pre-defined vocabularies of an artificial syntax. This paper presents a method for speaker-dependent speech identification that uses a back-propagation neural network to determine the phonemes present within a voice signal. The vocal tract changes slowly in time and can be modeled using the pitch and formant frequencies of the voice. These frequencies relate the position of the vocal tract to specific pronunciations and are obtained by using a homomorphic filtering process that separates the vocal tract's impulse response from the excitation source. The frequency representation of this response is concatenated with the excitation containing the pitch frequency and applied to the input layer of the neural network. From this information, the network selects combinations of features that identify the phonemes which are present. This network was trained on a set of speaker dependent phonemes, and now phonetically classifies new speech input. This classification scheme could be used to translate linguistic messages into machine code with a very high data rate. This benefit would allow for real-time interaction with machines with no specialized training. Applications could be as simple as providing quick voice to text processing or as diverse as implementing a control system with response time tied to specified voice patterns.

著录项

作者
St. George, B. A.;
展开▼
作者单位

展开▼
年度 1997
页码 1-78
总页数 78
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Neural nets; Speech; Coding; Speech recognition; Voice communications; Phonemes;

机译：神经网络;语音;编码;语音识别;语音通信;音素;

相似文献

外文文献
中文文献
专利

1. Deep neural network-based phoneme classification of standard Khasi dialect in continuous speech [J] . Bronson Syiem, L. Joyprakash Singh International Journal of Applied Pattern Recognition . 2019,第1期

机译：连续语音中基于深度神经网络的标准Khasi方言音素分类
2. Encoding Detection and Bit Rate Classification of AMR-Coded Speech Based on Deep Neural Network [J] . Seong-Hyeon SHIN, Woo-Jin JANG, Ho-Won YUN, IEICE transactions on information and systems . 2018,第1期

机译：基于深度神经网络的AMR编码语音的编码检测和比特率分类
3. Speech Recognition System Based On Phonemes Using Neural Networks [J] . N.Uma Maheswari, A.P.Kabilan, R.Venkatesh International journal of computer science and network security . 2009,第7期

机译：基于神经网络的音素语音识别系统
4. The Comparison and Combination of Genetic and Gradient Descent Learning in Recurrent Neural Networks: An Application to Speech Phoneme Classification [C] . Rohitash Chandra, Christian W. Omlin International Conference on Artificial Intelligence and Pattern Recognition . 2010

机译：经常性神经网络中遗传和梯度下降学习的比较与结合：语音音素分类的应用
5. An application of Artificial Neural Networks to forecast winning price: A comparison of Back-Propagation Networks and Adaptive Network-Based Fuzzy Inference Systems. [D] . Lin, Yingchih. 2010

机译：人工神经网络在中奖价格预测中的应用：反向传播网络与基于自适应网络的模糊推理系统的比较。
6. Back-propagation and counter-propagation neural networks for phylogenetic classification of ribosomal RNA sequences. [O] . C Wu, S Shivakumar 1994

机译：反向传播和反向传播神经网络用于核糖体RNA序列的系统发育分类。
7. Segment phoneme classification from speech under noisy conditions: Using amplitude-frequency modulation based two-dimensional auto-regressive features with deep neural networks [O] . Rangslang Rijuban 2016

机译：在嘈杂条件下从语音中对语音进行音素分类：使用基于深度神经网络的基于振幅-频率调制的二维自回归特征

Speech Coding and Phoneme Classification Using a Back-Propagation Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅