首页> 外文会议> >A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics

【24h】

A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics

机译：从词性统计中提取用于大词汇语音识别的基于类的语言模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel approach is presented to class-based language modeling based on part-of-speech statistics. It uses a deterministic word-to-class mapping, which handles words with alternative part-of-speech assignments through the use of ambiguity classes. The predictive power of word-based language models and the generalization capability of class-based language models are combined using both linear interpolation and word-to-class backoff, and both methods are evaluated. Since each word belongs to one precisely ambiguity class, an exact word-to-class backoff model can easily be constructed. Empirical evaluations on large-vocabulary speech-recognition tasks show perplexity improvements and significant reductions in word error-rate.

机译：提出了一种新颖的方法，用于基于词性统计的基于类的语言建模。它使用确定性的词到类映射，该映射通过使用歧义类来处理具有替代词性分配的词。基于单词的语言模型的预测能力和基于类的语言模型的泛化能力通过线性插值和单词到类的退避相结合，并对这两种方法进行了评估。由于每个词都属于一个精确的歧义类别，因此可以轻松构建一个精确的词对类退避模型。对大词汇量语音识别任务的实证评估表明，困惑度得到改善，单词错误率显着降低。

著录项

来源
《》|1999年|P.537-540|共4页
会议地点
作者
Samuelsson; C.; Reichl; W.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish [J] . Smywinski-Pohl Alexsander, Ziolko Bartosz International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2016,第2期

机译：形态语法和基于类的语言模型在波兰语自动语音识别中的应用
2. Reducing latency for language identification based on large-vocabulary continuous speech recognition [J] . Takuma Okamoto, Atsuo Hiroe, Hisashi Kawai Acoustical science and technology . 2017,第1期

机译：减少基于大词汇量连续语音识别的语言识别延迟
3. Large-vocabulary speech recognition: A system for the Italian language [J] . IBM Journal of Research and Development . 1988,第2期

机译：大词汇语音识别：意大利语系统
4. A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics [C] . Christer Samuelsson, Wolfgang Reichl IEEE International Conference on Acoustics, Speech and Signal Processing . 1999

机译：基于类语言模型的语音语音识别从词性统计中提取
5. Balancing model resolution and generalizability in large-vocabulary continuous speech recognition. [D] . Luo, Xiaoqiang. 1999

机译：在大词汇量连续语音识别中平衡模型的分辨率和可推广性。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. Comparison Of Part-Of-Speech And Automatically Derived Category-Based Language Models For Speech Recognition [O] . T.R. Niesler, E. W. D. Whittaker, P.C. Woodland 1998

机译：语音识别的词性和自动派生基于类别的语言模型的比较
8. High-Accuracy Large-Vocabulary Speech Recognition Using Mixture Tying and Consistency Modeling. [R] . Digalakis, V., Murveit, H. 1994

机译：基于混合搭配和一致性建模的高精度大词汇量语音识别。

A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics

摘要

著录项

相似文献

相关主题

期刊订阅