【24h】

Continuous Digits Recognition Leveraging Invariant Structure

机译:利用不变结构的连续数字识别

获取原文

摘要

Recently, an invariant structure of speech was proposed, where the inevitable acoustic variations caused by non-linguistic factors are effectively removed from speech. The invariant structure was applied to isolated word recognition and the experimental results showed good performance. However, the previous method can't apply to continuous speech recognition directly because there was no efficient decoding algorithm. In this paper, we propose a method to leverage the invariant structure in continuous digits recognition. We use a traditional HMM-based Automatic Speech Recognition (ASR) system to get TV-best lists with phone alignments. Then we construct invariant structures using these phone alignments and re-rank the N-best lists by investigating which hypothesis is structurally more valid. Experimental results show a relative WER improvement of 17.4% over the baseline HMM-based ASR system.
机译:最近,提出了语音的不变结构,其中有效地消除了由非语言因素引起的不可避免的声音变化。将不变结构应用于孤立词识别,实验结果表明该算法具有良好的性能。但是,由于没有高效的解码算法,因此先前的方法无法直接应用于连续语音识别。在本文中,我们提出了一种在连续数字识别中利用不变结构的方法。我们使用传统的基于HMM的自动语音识别(ASR)系统来获取具有电话对齐功能的电视最佳列表。然后,我们使用这些电话比对来构造不变结构,并通过调查哪种假设在结构上更有效来对N-best列表进行重新排序。实验结果表明,相对于基于基线HMM的ASR系统,WER相对提高了17.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号