How to Add Word Classes to the Kaldi Speech Recognition Toolkit

机译：如何将单词类添加到Kaldi语音识别工具包

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper explains and illustrates how the concept of word classes can be added to the widely used open-source speech recognition toolkit Kaldi. The suggested extensions to existing Kaldi recipes are limited to the word-level grammar (G) and the pronunciation lexicon (L) models. The implementation to modify the weighted finite state transducers employed in Kaldi makes use of the OpenFST library. In experiments on small and mid-sized corpora with vocabulary sizes of 1.5 K and 5.5 K respectively a slight improvement of the word error rate is observed when the approach is tested with (hand-crafted) word classes. Furthermore it is shown that the introduction of sub-word unit models for open word classes can help to robustly detect and classify out-of-vocabulary words without impairing word recognition accuracy.

机译：本文解释并说明了如何将Word类的概念添加到广泛使用的开源语音识别工具包KALDI中。现有KALDI配方的建议扩展仅限于单词级语法（g）和发音词典（l）模型。修改Kaldi中使用的加权有限状态传感器的实现利用OpenFST库。在具有1.5 k和5.5克的词汇量的小型和中型Corpora的实验中，观察到使用（手工制作）字类的方法进行略微改善字错误率。此外，表明开放字类的子字单元模型的引入可以有助于强大地检测和分类失入失败的单词，而不会损害字识别精度。

著录项

来源
《International Conference on Text, Speech and Dialogue》|2016年|550p|共9页
会议地点
作者
Axel Horndasch; Caroline Kaufhold; Elmar Noth;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.1-53;
关键词
Word classes; Kaldi speech recognition toolkit; OOV detection and classification;

机译：Word类;Kaldi语音识别工具包;OOV检测和分类;

相似文献

外文文献
中文文献
专利

1. DNN based continuous speech recognition system of Punjabi language on Kaldi toolkit [J] . Jyoti Guglani, A. N. Mishra International journal of speech technology . 2021,第1期

机译：基于DNN基于Kaldi Toolkit的Punjabi语言的连续语音识别系统
2. Automatic speech recognition system with pitch dependent features for Punjabi language on KALDI toolkit [J] . Guglani Jyoti, Mishra A. N. Applied Acoustics . 2020,第Octa期

机译：在Kaldi Toolkit上的Punjabi语言具有音调依赖功能的自动语音识别系统
3. Continuous Punjabi speech recognition model based on Kaldi ASR toolkit [J] . Jyoti Guglani, A. N. Mishra International journal of speech technology . 2018,第2期

机译：基于Kaldi ASR工具包的旁遮普语连续语音识别模型
4. How to Add Word Classes to the Kaldi Speech Recognition Toolkit [C] . Axel Horndasch, Caroline Kaufhold, Elmar Noeth International conference on text, speech and dialogue . 2016

机译：如何将单词类添加到Kaldi语音识别工具包
5. Learning Out-of-Vocabulary Words in Automatic Speech Recognition. [D] . Qin, Long. 2013

机译：在自动语音识别中学习词汇外单词。
6. Measuring open-set word recognition in school-aged children: Corpus of monosyllabic target words and speech maskers [O] . Angela Yarnell Bonino, Ashley R. Malley -1

机译：测量学龄儿童的开放式单词识别：单音节目标单词和语音掩盖语的语料库
7. Learning to Count Words in Fluent Speech Enables Online Speech Recognition [O] . George Sterpu, Christian Saam, Naomi Harte 2021

机译：学习在流利的演讲中计算单词，可以在线演讲识别

How to Add Word Classes to the Kaldi Speech Recognition Toolkit

摘要

著录项

相似文献

相关主题

期刊订阅