首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >Efficient nearly error-less LVCSR decoding based on incremental forward and backward passes
【24h】

Efficient nearly error-less LVCSR decoding based on incremental forward and backward passes

机译:基于增量前向和后向传递的高效几乎无差错的LVCSR解码

获取原文

摘要

We show that most search errors can be identified by aligning the results of a symmetric forward and backward decoding pass. Based on this knowledge, we introduce an efficient high-level decoding architecture which yields virtually no search errors, and requires virtually no manual tuning. We perform an initial forward- and backward decoding with tight initial beams, then we identify search errors, and then we recursively increment the beam sizes and perform new forward and backward decodings for erroneous intervals until no more search errors are detected. Consequently, each utterance and even each single word is decoded with the smallest beam size required to decode it correctly. On all tested systems we achieve an error rate equal or very close to classical decoding with ideally tuned beam size, but unsupervisedly without specific tuning, and at around 2 times faster runtime. An additional speedup by factor 2 can be achieved by decoding the forward and backward pass in separate threads.
机译:我们表明,大多数搜索错误都可以通过对齐对称向前和向后解码遍历的结果来识别。基于此知识,我们介绍了一种高效的高级解码体系结构,该体系结构几乎不会产生搜索错误,并且几乎不需要手动调整。我们使用紧密的初始波束执行初始向前和向后解码,然后确定搜索错误,然后递归地增加波束大小并针对错误的间隔执行新的向前和向后解码,直到检测不到更多搜索错误为止。因此,每个发声甚至每个单词都以正确解码所需的最小波束大小进行解码。在所有经过测试的系统上,我们都可以通过理想的波束大小来实现等于或非常接近经典解码的错误率,但是无需进行专门的调整就可以无监督地运行,并且运行时间快2倍。可以通过在单独的线程中解码前向和后向传递来实现因数2的额外加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号