...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR
【24h】

Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR

机译:用于ASR的基于平稳起始的单阶段判别训练的HMM模型

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years, end-to-end approaches to automatic speech recognition have received considerable attention as they are much faster in terms of preparing resources. However, conventional multistage approaches, which rely on a pipeline of training hidden Markov models (HMM)-GMM models and tree-building steps still give the state-of-the-art results on most databases. In this study, we investigate flat-start one-stage training of neural networks using lattice-free maximum mutual information (LF-MMI) objective function with HMM for large vocabulary continuous speech recognition. We thoroughly look into different issues that arise in such a setup and propose a standalone system, which achieves word error rates (WER) comparable with that of the state-of-the-art multi-stage systems while being much faster to prepare. We propose to use full biphones to enable flat-start context-dependent (CD) modeling and show through experiments that our CD modeling approach can be almost as effective as regular tree-based CD modeling. We show that our flat-start LF-MMI setup together with this tree-free CD modeling technique achieves 10 to 25 % relative WER reduction compared to other end-to-end methods on well-known databases. The improvements are larger for smaller databases.
机译:近年来,端到端的自动语音识别方法已经备受关注,因为它们在准备资源方面要快得多。但是,传统的多阶段方法依靠训练隐马尔可夫模型(HMM)-GMM模型和树构建步骤的流水线,仍然可以为大多数数据库提供最新的结果。在这项研究中,我们研究了使用无格最大互信息(LF-MMI)目标函数和HMM进行大词汇量连续语音识别的神经网络的扁平化第一步训练。我们彻底研究了这种设置中出现的不同问题,并提出了一个独立的系统,该系统可实现与最新的多级系统相媲美的字错误率(WER),并且准备速度更快。我们建议使用完整的Biphone来实现扁平启动上下文相关(CD)建模,并通过实验证明我们的CD建模方法几乎可以与常规的基于树的CD建模一样有效。我们证明,与知名数据库上的其他端到端方法相比,我们的平稳启动LF-MMI设置与这种无树CD建模技术相比,可实现10%到25%的相对WER降低。对于较小的数据库,改进较大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号