N-gram Language Model Based on Multi-Word Expressions in Web Documents for Speech Recognition and Closed-Captioning

机译：Web文档中基于多词表达的N元语法模型用于语音识别和隐藏字幕

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Automatic speech recognition technique is generally used to align the closed caption text to video data. It is important to increase the speech recognition accuracy for the accurate closed-captioning. This paper proposes the method for constructing N-gram language model based on multi word expressions (MWEs) from web retrieval results to improve the speech recognition performance. The web retrieval experiment for examining the distribution of web count numbers for MWEs and the speech recognition experiment for investigating the effectiveness of MWEs are conducted. The experimental results show that the proposed method can improve the recognition performance and the closed-captioning accuracy.

机译：自动语音识别技术通常用于将隐藏字幕文本与视频数据对齐。对于准确的隐藏式字幕来说，提高语音识别的准确性非常重要。提出了一种基于网页检索结果的基于多词表达（MWE）的N元语法模型的构建方法，以提高语音识别性能。进行了用于检查MWE的Web计数数量分布的Web检索实验和用于调查MWE有效性的语音识别实验。实验结果表明，该方法可以提高识别性能和隐藏字幕的准确性。

著录项

来源
《International Conference on Asian Language Processing》|2012年|p.225-228|共4页
会议地点
作者
Takahashi Shinya; Morimoto Tsuyoshi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;程序语言、算法语言;
关键词
N-gram; closed-captioning; language model; speech recognition; web documents;

机译：N-gram;字幕;语言模型;语音识别;网络文档;

相似文献

外文文献
中文文献
专利

1. Looking at alternatives within the framework of n-gram based language modeling for spontaneous speech recognition [J] . Luc Lussier, Edward W. D. Whittaker, Sadaoki Furui 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2003,第517期

机译：在基于n元语法的语言建模框架内寻找自发语音识别的替代方案
2. Looking at alternatives within the framework of n-gram based language modeling for spontaneous speech recognition [J] . Luc Lussier, Edward W. D. Whittaker, Sadaoki Furui 電子情報通信学会技術研究報告. 音声. Speech . 2003,第519期

机译：在基于n元语法的语言建模框架内寻找自发语音识别的替代方案
3. Backoff hierarchical class n-gram language models: effectiveness to model unseen events in speech recognition [J] . Imed Zitouni Computer speech and language . 2007,第1期

机译：退避层次类n-gram语言模型：对语音识别中未见事件建模的有效性
4. N-gram Language Model Based on Multi-Word Expressions in Web Documents for Speech Recognition and Closed-Captioning [C] . Takahashi Shinya, Morimoto Tsuyoshi International Conference on Asian Language Processing . 2012

机译：基于Web文档的多字大表达式的语音识别和闭幕标题的n-gram语言模型
5. Language-independent text learning with statistical n-gram language models. [D] . Peng, Fuchun. 2003

机译：统计n-gram语言模型的独立于语言的文本学习。
6. Modeling Actions of PubMed Users with N-Gram Language Models [O] . Jimmy Lin, W. John Wilbur -1

机译：N-Gram语言模型对PubMed用户的建模动作
7. WAYS TO IMPROVE N-GRAM LANGUAGE MODELS FOR OCR AND SPEECH RECOGNITION OF SLAVIC LANGUAGES [O] . Volume Issue, V. Taranukha 2015

机译：提高N-GRam语言模型的方法，用于对sLaVIC语言进行OCR和语音识别

N-gram Language Model Based on Multi-Word Expressions in Web Documents for Speech Recognition and Closed-Captioning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅