首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Feature combination and stacking of recurrent and non-recurrent neural networks for LVCSR
【24h】

Feature combination and stacking of recurrent and non-recurrent neural networks for LVCSR

机译:LVCSR的递归和非递归神经网络的特征组合和堆叠

获取原文

摘要

This paper investigates the combination of different short-term features and the combination of recurrent and non-recurrent neural networks (NNs) on a Spanish speech recognition task. Several methods exist to combine different feature sets such as concatenation or linear discriminant analysis (LDA). Even though all these techniques achieve reasonable improvements, feature combination by multi-layer perceptrons (MLPs) outperforms all known approaches. We develop the concept of MLP based feature combination further using recurrent neural networks (RNNs). The phoneme posterior estimates derived from an RNN lead to a significant improvement over the result of the MLPs and achieve a 5% relative better word error rate (WER) with much less parameters.
机译:本文研究了西班牙语音识别任务中不同短期特征的组合以及递归和非递归神经网络(NNs)的组合。存在几种组合不同特征集的方法,例如串联或线性判别分析(LDA)。即使所有这些技术均取得了合理的改进,但多层感知器(MLP)进行的特征组合仍胜过所有已知方法。我们进一步使用递归神经网络(RNN)开发了基于MLP的特征组合的概念。从RNN得出的音素后验估计比MLP的结果有显着改善,并且参数少得多,相对误码率(WER)达到5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号