首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >DEEP HIERARCHICAL BOTTLENECK MRASTA FEATURES FOR LVCSR
【24h】

DEEP HIERARCHICAL BOTTLENECK MRASTA FEATURES FOR LVCSR

机译:深层等级瓶颈MRASTA LVCSR的功能

获取原文

摘要

Hierarchical Multi Layer Perceptron (MLP) based long-term feature extraction is optimized for TANDEM connectionist large vocabulary continuous speech recognition (LVCSR) system within the QUAERO project. Training the bottleneck MLP on multi-resolutional RASTA filtered critical band energies, more than 20% relative word error rate (WER) reduction over standard MFCC system is observed after optimizing the number of target labels. Furthermore, introducing a deeper structure in the hierarchical bottleneck processing the relative gain increases to 25%. The final system based on deep bottleneck TANDEM features clearly outperforms the hybrid approach, even if the long-term features are also presented to the deep MLP acoustic model. The results are also verified on evaluation data of the year 2012, and about 20% relative WER improvement over classical cepstral system is measured even after speaker adaptive training.
机译:基于分层的多层Perceptron(MLP)的长期特征提取是针对Quaero项目中的串联连接师大词汇表连续语音识别(LVCSR)系统进行了优化。在优化目标标签的数量之后,观察到培训多分辨率Rasta过滤的关键频带能量上的瓶颈MLP,在标准MFCC系统上观察到超过20%的相对字错误率(WER)。此外,在处理相对增益的分层瓶颈中引入更深的结构,相对增益增加到25%。基于深层瓶颈的最终系统串联特征显然优于混合方法,即使也呈现给深MLP声学模型也是如此。结果还验证了2012年的评估数据,即使在演讲者自适应培训之后,均衡了古典颅骨系统的约20%的相对行动改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号