Investigation of Maximum Entropy Hybrid Language Models for Open Vocabulary German and Polish LVCSR

机译：开放词汇德语和波兰语LVCSR的最大熵混合语言模型研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For languages like German and Polish, higher numbers of word inflections lead to high out-of-vocabulary (OOV) rates and high language model (LM) perplexities. Thus, one of the main challenges in large vocabulary continuous speech recognition (LVCSR) is recognizing an open vocabulary. In this paper, we investigate the use of mixed type of sub-word units in the same recognition lexicon. Namely, morphemic or syllabic units combined with pronunciations called graphones, normal graphemic morphemes or syllables, along with full-words. In addition, we investigate the suitability of hybrid mixed-unit N-grams as features for Maximum Entropy LM along with adaptation. We achieve significant improvements in recognizing OOVs and word error rate reductions for German and Polish LVCSR compared to the conventional full-word approach and state-of-the-art N-gram mixed type hybrid LM.

机译：对于像德语和波兰语这样的语言，更多的词变形会导致较高的词汇率（OOV）和较高的语言模型（LM）困惑。因此，大词汇量连续语音识别（LVCSR）的主要挑战之一是识别开放词汇。在本文中，我们研究了在同一识别词典中混合类型的子单词单元的使用。即，语素或音节单位与被称为graphones，正常graphemic morphemes或音节的发音以及全词结合在一起。此外，我们调查了混合混合单元N元语法作为最大熵LM的特征以及适应性的适用性。与传统的全字词方法和最新的N-gram混合型混合LM相比，我们在识别德语和波兰语LVCSR的OOV和减少字错误率方面取得了显着改进。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|1070-1073|共4页
会议地点
作者
M. Ali Basha Shaik; Amr El-Desoky Mousa; Ralf Schlueter; Hermann Ney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
open vocabulary; maximum entropy;

机译：开放的词汇最大熵;

相似文献

外文文献
中文文献
专利

1. Constructing Maximum Entropy Language Models For Movie Review Subjectivity Analysis [J] . Bo Chen, Hui He, Jun Guo Journal of experimental algorithmics . 2008,第2期

机译：电影评论主观性分析的最大熵语言模型的构建
2. Constructing Maximum Entropy Language Models for Movie Review Subjectivity Analysis [J] . Bo Chen, Hui He, Jun Guo 计算机科学技术学报（英文版） . 2008,第002期

机译：电影评论主观性分析的最大熵语言模型的构建
3. Combining Statistical Language Models via the Latent Maximum Entropy Principle [J] . SHAOJUN WANG, DALE SCHUURMANS, FUCHUN PENG, Machine Learning . 2005,第1a3期

机译：通过潜在最大熵原理组合统计语言模型
4. Investigation of Maximum Entropy Hybrid Language Models for Open Vocabulary German and Polish LVCSR [C] . M. Ali. Basha Shaik, Amr El-Desoky Mousa, Ralf Schlüter, INTERSPEECH 2012 . 2012

机译：开放词汇德语与波兰LVCSR的最大熵混合语言模型的研究
5. Maximum entropy language modeling with non-local dependencies. [D] . Wu, Jun. 2003

机译：具有非本地依赖性的最大熵语言建模。
6. Strategy Use in Second Language Vocabulary Learning and Its Relationships With the Breadth and Depth of Vocabulary Knowledge: A Structural Equation Modeling Study [O] . Na Fan 2020

机译：策略在第二语言词汇学习中的应用及其与词汇知识广度和深度的关系：结构方程模型研究
7. Investigation of Maximum Entropy Hybrid Language Models for Open Vocabulary German and Polish LVCSR [O] . Basha Shaik Mahaboob Ali, El-Desoky Mousa Amr, Schlüter Ralf, 2012

机译：开放词汇德语和波兰语LVCSR的最大熵混合语言模型研究
8. Adaptive Statistical Language Modeling; A Maximum Entropy Approach. [R] . Rosenfeld, R. 1994

机译：自适应统计语言建模;最大熵方法。

Investigation of Maximum Entropy Hybrid Language Models for Open Vocabulary German and Polish LVCSR

摘要

著录项

相似文献

相关主题

期刊订阅