首页> 外文会议>International conference on statistical language and speech processing >Class n-Gram Models for Very Large Vocabulary Speech Recognition of Finnish and Estonian

【24h】

Class n-Gram Models for Very Large Vocabulary Speech Recognition of Finnish and Estonian

机译：用于芬兰语和爱沙尼亚语的非常大的词汇语音识别的n-Gram类模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study class n-gram models for very large vocabulary speech recognition of Finnish and Estonian. The models are trained with vocabulary sizes of several millions of words using automatically derived classes. To evaluate the models on Finnish and an Estonian broadcast news speech recognition task, we modify Aalto University's LVCSR decoder to operate with the class n-grams and very large vocabularies. Linear interpolation of a standard n-gram model and a class n-gram model provides relative perplexity improvements of 21.3% for Finnish and 12.8 % for Estonian over the n-gram model. The relative improvements in word error rates are 5.5% for Finnish and 7.4% for Estonian. We also compare our word-based models to a state-of-the-art unlimited vocabulary recognizer utilizing subword n-gram models, and show that the very large vocabulary word-based models can perform equally well or better.

机译：我们研究类n元语法模型，用于芬兰语和爱沙尼亚语的非常大的词汇语音识别。使用自动派生的类以数百万个单词的词汇量训练模型。为了评估芬兰语和爱沙尼亚语广播新闻语音识别任务上的模型，我们修改了阿尔托大学的LVCSR解码器，使其可以使用n-gram类和非常大的词汇量进行操作。与n-gram模型相比，标准n-gram模型和n-gram类的线性插值方法使芬兰语和2爱沙尼亚语的相对困惑度提高了21.3％。芬兰和爱沙尼亚语的单词错误率的相对提高是5.5％和7.4％。我们还将基于单词的模型与利用子单词n-gram模型的最新无限制词汇识别器进行了比较，并显示出非常大的基于单词的词汇模型可以表现良好或更好。

著录项

来源
《International conference on statistical language and speech processing 》|2016年|133-144|共12页
会议地点
作者
Matti Varjokallio; Mikko Kurimo; Sami Virpioja;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Language modelling; Class n-gram models; Morphologi- cally rich languages; Speech recognition;

机译：语言建模; n-gram类模型;形态丰富的语言;语音识别;

相似文献

外文文献
中文文献
专利

1. Morphologically motivated word classes for very large vocabulary speech recognition of Finnish and Estonian [J] . Matti Varjokallio, Sami Virpioja, Mikko Kurimo Computer speech and language . 2021 ,第Mara期

机译：非常大的词汇语音识别芬兰语和爱沙尼亚人的形态上的词课程
2. Automatic Speech Recognition With Very Large Conversational Finnish and Estonian Vocabularies [J] . Seppo Enarvi, Peter Smit, Sami Virpioja, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017 ,第11期

机译：具有大量会话芬兰语和爱沙尼亚语词汇的自动语音识别
3. A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition [J] . Xiaolong Li, Yunxin Zhao Computer speech and language . 2007 ,第1期

机译：用于大词汇量连续语音识别的快速且高效存储的N元语法模型查找方法
4. Class n-Gram Models for Very Large Vocabulary Speech Recognition of Finnish and Estonian [C] . Matti Varjokallio, Mikko Kurimo, Sami Virpioja International Conference on Statistical Language and Speech Processing . 2016

机译：用于非常大的词汇语音识别芬兰语和爱沙尼亚人的N-Gram模型
5. Pronunciation modeling for large vocabulary speech recognition [D] . Kantor, Arthur 2010

机译：用于大词汇量语音识别的语音建模
6. Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models [O] . A. Paats, T. Alumäe, E. Meister, 2018

机译：一项爱沙尼亚放射线语音识别系统临床表现的回顾性分析：不同声学和语言模型的影响
7. Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies [O] . Enarvi, Seppo, Smit, Peter, Virpioja, Sami, 2017

机译：具有非常大的会话芬兰语和爱沙尼亚语词汇量的自动语音识别

Class n-Gram Models for Very Large Vocabulary Speech Recognition of Finnish and Estonian

摘要

著录项

相似文献

相关主题

期刊订阅