首页> 外文会议>Workshop on Automatic Speech Recognition and Understanding >DEEP MAXOUT NETWORKS FOR LOW-RESOURCE SPEECH RECOGNITION
【24h】

DEEP MAXOUT NETWORKS FOR LOW-RESOURCE SPEECH RECOGNITION

机译:低资源语音识别的深颤音网络

获取原文

摘要

As a feed-forward architecture, the recently proposed maxout networks integrate dropout naturally and show state-of-the-art results on various computer vision datasets. This paper investigates the application of deep maxout networks (DMNs) to large vocabulary continuous speech recognition (LVCSR) tasks. Our focus is on the particular advantage of DMNs under low-resource conditions with limited transcribed speech. We extend DMNs to hybrid and bottleneck feature systems, and explore optimal network structures (number of maxout layers, pooling strategy, etc) for both setups. On the newly released Babel corpus, behaviors of DMNs are extensively studied under different levels of data availability. Experiments show that DMNs improve low-resource speech recognition significantly. Moreover, DMNs introduce sparsity to their hidden activations and thus can act as sparse feature extractors.
机译:作为前馈架构,最近提出的Maxout网络自然集成了丢失并显示了各种计算机视觉数据集的最先进的结果。本文调查了深度磁最大网络(DMNS)对大型词汇连续语音识别(LVCSR)任务的应用。我们的重点是DMN在低资源条件下的特定优势,有限转录的讲话。我们将DMNS扩展到混合动力和瓶颈功能系统,并探索两个设置的最佳网络结构(磁最大层数,池汇率策略等)。在新发布的Babel语料库上,在不同的数据可用性水平下广泛研究DMN的行为。实验表明,DMN显着提高低资源语音识别。此外,DMNS将稀疏性引入其隐藏的激活,因此可以充当稀疏特征提取器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号