【24h】

Acoustic Look-Ahead for More Efficient Decoding in LVCSR

机译:在LVCSR中进行声学前瞻以实现更高效的解码

获取原文

摘要

In this paper we propose novel approximations of a generalized acoustic look-ahead to speed up the search process in large vocabulary continuous speech recognition (LVCSR). Unlike earlier methods, we do not employ any phoneme- or syllable level heuristics. First we define and analyze the perfect acoustic look-ahead as a simple pre-evaluation of the original acoustic models into the future. This method is very slow, but reveals the best possible impact on the search space that can be achieved through acoustic look-ahead. In a second step, we derive efficient and simple approximative look-ahead models from the perfect models. We show that the approximative models compare well to the perfect models regarding the search space, and that the approximative models significantly improve the efficiency in comparison to the baseline, without any negative effect on the precision.
机译:在本文中,我们提出了一种通用的声学预见的新颖近似方法,以加快大词汇量连续语音识别(LVCSR)的搜索过程。与以前的方法不同,我们不使用任何音素或音节水平试探法。首先,我们定义并分析理想的声学前瞻,作为对未来声学模型的简单预评估。此方法非常慢,但是可以通过声学预视来获得对搜索空间的最佳影响。第二步,我们从完美模型中得出有效且简单的近似预见模型。我们表明,对于搜索空间,逼近模型与完美模型具有很好的比较,并且与基线相比,逼近模型显着提高了效率,而对精度没有任何负面影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号