首页> 外文会议>IEEE Latin American Conference on Computational Intelligence >Online pattern recognition for Portuguese phonemes using Multi-layer Perceptron combined with recurrent non-linear autoregressive Neural Networks with exogenous inputs
【24h】

Online pattern recognition for Portuguese phonemes using Multi-layer Perceptron combined with recurrent non-linear autoregressive Neural Networks with exogenous inputs

机译:使用多层感知器结合具有外部输入的递归非线性自回归神经网络的葡萄牙语音素在线模式识别

获取原文

摘要

Off-line pattern recognition in speech signals is a complex task. Yet, this task becomes harder when the recognition result is required online. The present work proposes an online identification of the Portuguese language phonemes using an nonlinear autoregressive model with exogenous inputs, commonly called NARX. The process first extracts the frequency characteristics of the input speech signals and pre-classifies them into one of the ten possible groups of phonemes, as available in the Portuguese language. This pre-classification is done using a multilayer perceptron network (MLP) with a supervised learning. Subsequently, the MLP output vector, together with the vector that carries the input frequencies, feeds a NARX neural network by means of a temporal delay of four times and feed-backward recurrent links that encompass the results of all hidden layers of the network. As a result of this process, the proposed phoneme recognition process improves the accuracy of the online identification of the Portuguese spoken phonemes during a natural conversation.
机译:语音信号中的离线模式识别是一项复杂的任务。然而,当在线需要识别结果时,该任务变得更加困难。本工作提出了使用带有外部输入的非线性自回归模型(通常称为NARX)对葡萄牙语音素进行在线识别的方法。该过程首先提取输入语音信号的频率特性,然后将其预分类为十种可能的音素组之一,这在葡萄牙语中是可用的。这种预分类是使用具有监督学习功能的多层感知器网络(MLP)进行的。随后,MLP输出向量与携带输入频率的向量一起,通过四倍的时间延迟和包含网络所有隐藏层结果的前馈递归链接,为NARX神经网络提供了反馈。作为该过程的结果,提出的音素识别过程提高了自然对话期间葡萄牙语口语音素在线识别的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号