Character-based units for unlimited vocabulary continuous speech recognition

机译：基于字符的单位，用于无限制的词汇连续语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We study character-based language models in the state-of-the-art speech recognition framework. This approach has advantages over both word-based systems and so-called end-to-end ASR systems that do not have separate acoustic and language models. We describe the necessary modifications needed to build an effective character-based ASR system using the Kaldi toolkit and evaluate the models based on words, statistical morphs, and characters for both Finnish and Arabic. The morph-based models yield the best recognition results for both well-resourced and lower-resourced tasks, but the character-based models are close to their performance in the lower-resource tasks, outperforming the word-based models. Character-based models are especially good at predicting novel word forms that were not seen in the training data. Using character-based neural network language models is both computationally efficient and provides a larger gain compared to the morph and word-based systems.

机译：我们在最先进的语音识别框架中研究基于字符的语言模型。这种方法相对于基于单词的系统和没有单独的声学和语言模型的所谓的端到端ASR系统均具有优势。我们描述了使用Kaldi工具包构建有效的基于字符的ASR系统所需的必要修改，并基于单词，统计词形和芬兰语和阿拉伯语字符对模型进行了评估。基于词素的模型对于资源丰富的资源和资源较少的任务都能产生最佳的识别结果，但是基于字符的模型在资源较少的任务中的性能接近其性能，优于基于单词的模型。基于字符的模型尤其擅长预测训练数据中未出现的新颖单词形式。与基于词素和单词的系统相比，使用基于字符的神经网络语言模型既计算效率高，又提供了更大的收益。

著录项

来源
《2017 IEEE Automatic Speech Recognition and Understanding Workshop》|2017年|149-156|共8页
会议地点 Okinawa(JP)
作者
Peter Smit; Siva Reddy Gangireddy; Seppo Enarvi; Sami Virpioja; Mikko Kurimo;
展开▼
作者单位

Department of Signal Processing and Acoustics, Aalto University, Finland;

Department of Signal Processing and Acoustics, Aalto University, Finland;

Department of Signal Processing and Acoustics, Aalto University, Finland;

Department of Signal Processing and Acoustics, Aalto University, Finland;

Department of Signal Processing and Acoustics, Aalto University, Finland;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Computational modeling; Vocabulary; Speech recognition; Training; Acoustics; Speech; Context modeling;

机译：计算建模;词汇;语音识别;训练;声学;语音;语境建模;;

相似文献

外文文献
中文文献
专利

1. Korean large vocabulary continuous speech recognition with morpheme-based recognition units [J] . Oh-Wook Kwon, Jun Park Speech Communication . 2003,第3a4期

机译：具有基于词素的识别单元的韩语大词汇量连续语音识别
2. A usage of the syllable unit based on morphological statistics in Korean large vocabulary continuous speech recognition system [J] . Hyok-Chol Ri International journal of speech technology . 2019,第4期

机译：基于形态统计的音节单位在韩语大词汇量连续语音识别系统中的应用
3. Unlimited vocabulary speech recognition with morph language models applied to Finnish [J] . Teemu Hirsimaki, Mathias Creutz, Vesa Siivola, Computer speech and language . 2006,第4期

机译：带有芬兰语的变形语言模型的无限词汇语音识别
4. Character-based units for unlimited vocabulary continuous speech recognition [C] . Peter Smit, Siva Reddy Gangireddy, Seppo Enarvi, IEEE Workshop on Automatic Speech Recognition and Understanding . 2017

机译：基于字符的无限词汇连续语音识别的单位
5. An Error Detection and Correction Framework to Improve Large Vocabulary Continuous Speech Recognition [D] . Zhou, Zhengyu 2009

机译：一种提高大词汇量连续语音识别能力的错误检测与纠正框架
6. Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition [O] . Jibin Wu, Emre Yılmaz, Malu Zhang, 2020

机译：大型词汇自动语音识别深尖峰神经网络
7. Korean large vocabulary continuous speech recognition with morpheme-based recognition units [O] . Oh-wook Kwon, Jun Park 2003

机译：韩语大词汇连续语音识别与基于语素的识别单元

Character-based units for unlimited vocabulary continuous speech recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅