Looking at alternatives within the framework of n-gram based language modeling for spontaneous speech recognition

Luc Lussier; Edward W. D. Whittaker; Sadaoki Furui

首页> 外文期刊>電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication >Looking at alternatives within the framework of n-gram based language modeling for spontaneous speech recognition

【24h】

Looking at alternatives within the framework of n-gram based language modeling for spontaneous speech recognition

机译：在基于n元语法的语言建模框架内寻找自发语音识别的替代方案

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents different methods using a weighted mixture of word and word-class language models in order to perform language model adaptation. A general language model is built from the whole training corpus, then several numbers of clusters are created according to a word co-occurrence measure and finally, word models as well as word-class models are built from each cluster. The general language model is then combined with one or several other models chosen according to a minimum perplexity criterion. Results show an absolute reduction of the word error rate of 1.40% and 0.49% on average for two different test sets of the "Corpus of Spontaneous Japanese."

机译：本文介绍了使用单词和单词类语言模型的加权混合来执行语言模型自适应的不同方法。从整个训练语料库中建立一个通用的语言模型，然后根据单词共现度量来创建多个聚类，最后，从每个聚类中构建单词模型以及单词类模型。然后将通用语言模型与根据最小困惑度标准选择的一个或几个其他模型组合。结果显示，“自发日语Corpus”的两个不同测试集的平均误码率平均降低了1.40％和0.49％。

著录项

来源
《電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication》 |2003年第517期|共5页
作者
Luc Lussier; Edward W. D. Whittaker; Sadaoki Furui;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信;
关键词
Spontaneous speech recognition; Language model adaptation; Document clustering; Word models; word-class models; EM algorithm;

机译：自发语音识别语言模型自适应文档聚类词模型词类模型EM算法;

相似文献

外文文献
中文文献
专利

1. Looking at alternatives within the framework of n-gram based language modeling for spontaneous speech recognition [J] . Luc Lussier, Edward W. D. Whittaker, Sadaoki Furui 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2003,第517期

机译：在基于n元语法的语言建模框架内寻找自发语音识别的替代方案
2. Backoff hierarchical class n-gram language models: effectiveness to model unseen events in speech recognition [J] . Imed Zitouni Computer speech and language . 2007,第1期

机译：退避层次类n-gram语言模型：对语音识别中未见事件建模的有效性
3. N-gram Approximation of Latent Words Language Models for Domain Robust Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2016,第10期

机译：领域鲁棒自动语音识别的潜在词语言模型的N语法逼近
4. Spontaneous Speech Understanding in Train Timetable Inquiry Processing Based on N-gram Language Models and Finite State Transducers [C] . Libor JELINEK, Lubos SMIDL 8th World Multi-Conference on Systemics, Cybernetics and Informatics(SCI 2004) vol.6: Image, Acoustic, Signal Processing and Optical Systems, Technologies and Applications . 2004

机译：基于N-gram语言模型和有限状态传感器的列车时刻表查询处理中的自发语音理解
5. Supporting Speech-Language Pathologist Evidence-Based Practice Use: A Mixed-Methods Study in Skilled Nursing Facilities within the Promoting Action on Research Implementation in Health Services Framework. [D] . Douglas, Natalie F. 2013

机译：支持言语病理学家循证实践的使用：在卫生服务框架内促进研究实施的行动中，对熟练护理设施进行的混合方法研究。
6. Modeling Actions of PubMed Users with N-Gram Language Models [O] . Jimmy Lin, W. John Wilbur -1

机译：N-Gram语言模型对PubMed用户的建模动作
7. WAYS TO IMPROVE N-GRAM LANGUAGE MODELS FOR OCR AND SPEECH RECOGNITION OF SLAVIC LANGUAGES [O] . Volume Issue, V. Taranukha 2015

机译：提高N-GRam语言模型的方法，用于对sLaVIC语言进行OCR和语音识别

Looking at alternatives within the framework of n-gram based language modeling for spontaneous speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅