A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics

机译：基于类语言模型的语音语音识别从词性统计中提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel approach is presented to class-based language modeling based on part-of-speech statistics. It uses a deterministic word-to-class mapping, which handles words with alternative part-of-speech assignments through the use of ambiguity classes. The predictive power of word-based language models and the generalization capability of class-based language models are combined using both linear interpolation and word-to-class backoff, and both methods are evaluated. Since each word belongs to oneprecisely ambiguity class, an exact word-to-class backoff model can easily be constructed. Empirical evaluations on large-vocabulary speech-recognition tasks show perplexity improvements and significant reductions in word error-rate.

机译：基于词性统计数据的基于类语言建模的一种新方法。它使用了确定性的Word-to Class映射，通过使用歧义类处理具有替代词性分配的单词。基于Word的语言模型的预测力和基于类语言模型的泛化能力，使用线性插值和级别的退避组合，并且评估了两种方法。由于每个单词属于彼此歧义类，因此可以轻松构建精确的单位退避模型。大词汇表演讲任务的实证评估显示出困惑的改进和重大减少单词误差率。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|1999年||共4页
会议地点
作者
Christer Samuelsson; Wolfgang Reichl;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish [J] . Smywinski-Pohl Alexsander, Ziolko Bartosz International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2016,第2期

机译：形态语法和基于类的语言模型在波兰语自动语音识别中的应用
2. Reducing latency for language identification based on large-vocabulary continuous speech recognition [J] . Takuma Okamoto, Atsuo Hiroe, Hisashi Kawai Acoustical science and technology . 2017,第1期

机译：减少基于大词汇量连续语音识别的语言识别延迟
3. Large-vocabulary speech recognition: A system for the Italian language [J] . IBM Journal of Research and Development . 1988,第2期

机译：大词汇语音识别：意大利语系统
4. A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics [C] . Samuelsson, C., Reichl, . 1999

机译：从词性统计中提取用于大词汇语音识别的基于类的语言模型
5. Balancing model resolution and generalizability in large-vocabulary continuous speech recognition. [D] . Luo, Xiaoqiang. 1999

机译：在大词汇量连续语音识别中平衡模型的分辨率和可推广性。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. Comparison Of Part-Of-Speech And Automatically Derived Category-Based Language Models For Speech Recognition [O] . T.R. Niesler, E. W. D. Whittaker, P.C. Woodland 1998

机译：语音识别的词性和自动派生基于类别的语言模型的比较
8. High-Accuracy Large-Vocabulary Speech Recognition Using Mixture Tying and Consistency Modeling. [R] . Digalakis, V., Murveit, H. 1994

机译：基于混合搭配和一致性建模的高精度大词汇量语音识别。

A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics

摘要

著录项

相似文献

相关主题

期刊订阅