...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Large vocabulary continuous speech recognition using N-best linear lexicon search and tree lexicon search with 1-best approximation
【24h】

Large vocabulary continuous speech recognition using N-best linear lexicon search and tree lexicon search with 1-best approximation

机译:使用N-Best Linear Lexicon搜索和Tree Lexicon搜索的大词汇连续语音识别,使用1-最近似

获取原文
获取原文并翻译 | 示例
           

摘要

Computational cost on a large vocabulary continuous speech recognition system based on HMM is proportional to the number of words in the vocabulary. A tree-structured dictionary is generally used to reduce the number of states of HMMs. An approximation of dependency of word boundary and likelihood on word histories is also used to suppress the increase of hypotheses in the forward procedure. We first compared the search algorithms with a tree-structured dictionary using some approximation methods and that with a linear dictionary. The algorithm based on 1-best approximation with a tree-structured dictionary is efficient but frequently looses the optimal sentence hypothesis. Linear dictionary search can find the optimal hypothesis but needs much computational cost. Thus, we propose a search method using these two algorithms in parallel. We evaluated this new search algorithm and obtained improved word recognition rate and word accuracy by 5% and 3%, respectively on read speech, and 2% and 3%, respectively on broadcast news speech.
机译:基于HMM的大词汇连续语音识别系统的计算成本与词汇表中的单词数量成比例。树结构字典通常用于减少HMMS状态的数量。字边界依赖性的近似值和字历史上的似然性也用于抑制前向过程中的假设的增加。我们首先使用一些近似方法和线性词典将搜索算法与树结构型词典进行比较。基于1-最佳近似与树结构字典的算法是有效的,但经常丢失最佳句子假设。线性词典搜索可以找到最佳假设,但需要大量的计算成本。因此,我们提出了一种并行使用这两个算法的搜索方法。我们评估了这种新的搜索算法,分别在读语音上分别获得了5%和3%的文字识别率和字精度,分别在广播新闻语音上分别为2%和3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号