首页> 外文会议>International Conference on Text, Speech and Dialogue >How to Add Word Classes to the Kaldi Speech Recognition Toolkit
【24h】

How to Add Word Classes to the Kaldi Speech Recognition Toolkit

机译:如何将单词类添加到Kaldi语音识别工具包

获取原文

摘要

The paper explains and illustrates how the concept of word classes can be added to the widely used open-source speech recognition toolkit Kaldi. The suggested extensions to existing Kaldi recipes are limited to the word-level grammar (G) and the pronunciation lexicon (L) models. The implementation to modify the weighted finite state transducers employed in Kaldi makes use of the OpenFST library. In experiments on small and mid-sized corpora with vocabulary sizes of 1.5 K and 5.5 K respectively a slight improvement of the word error rate is observed when the approach is tested with (hand-crafted) word classes. Furthermore it is shown that the introduction of sub-word unit models for open word classes can help to robustly detect and classify out-of-vocabulary words without impairing word recognition accuracy.
机译:本文解释并说明了如何将Word类的概念添加到广泛使用的开源语音识别工具包KALDI中。现有KALDI配方的建议扩展仅限于单词级语法(g)和发音词典(l)模型。修改Kaldi中使用的加权有限状态传感器的实现利用OpenFST库。在具有1.5 k和5.5克的词汇量的小型和中型Corpora的实验中,观察到使用(手工制作)字类的方法进行略微改善字错误率。此外,表明开放字类的子字单元模型的引入可以有助于强大地检测和分类失入失败的单词,而不会损害字识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号