Speech Recognition with a Seamlessly Updated Language Model for Real-Time Closed-Captioning

机译：具有无缝更新语言模型的语音识别，用于实时隐藏字幕

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is desirable to consistently and seamlessly update a language model of speech recognition without stopping it for online applications such as real-time closed-captioning. This paper proposes a novel speech recognition system that enables the model to be updated at any time even while it is running. It can run the second decoder with the latest model in parallel, and their priority that must be accessed is controlled at a non-speech portion by an additional job process, which sends acoustic features only to an active target decoder with the latest model and sends recognized words to the backend manual error correction for closed-captioning. The system seamlessly updates the model and ensures endless speech recognition with the latest model at any time. Our new practical real-time closed-captioning system reduced word errors by two thirds with the proposed language model update mechanism in the speech recognition and captioning experiments for Japanese broadcast news programs.

机译：理想的是一致且无缝地更新语音识别的语言模型而不停止在线应用程序（例如实时隐藏字幕）的语言模型。本文提出了一种新颖的语音识别系统，该系统即使在模型运行时也可以随时对其进行更新。它可以并行运行具有最新模型的第二个解码器，并且必须通过额外的作业过程在非语音部分控制必须访问的优先级，该过程仅将声学特征发送给具有最新模型的活动目标解码器，并发送识别的字词，用于后端手动错误更正，用于隐藏式字幕。该系统无缝更新模型，并确保随时使用最新模型进行无尽的语音识别。在日本广播新闻节目的语音识别和字幕实验中，我们提出的语言模型更新机制为我们的新型实用实时字幕系统提供了建议的语言模型更新机制，将单词错误减少了三分之二。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.262-265|共4页
会议地点
作者
Toru Imai; Shinichi Homma; Akio Kobayashi; Takahiro Oku; Shoei Sato;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speech recognition; closed-captioning; language model; model updates;

机译：语音识别;关闭字幕;语言模型型号更新;

相似文献

外文文献
中文文献
专利

1. Syllable language models for Mandarin speech recognition: Exploiting character language models [J] . Liu X., Hieronymus J.L., Gales M.J.F., The Journal of the Acoustical Society of America . 2013,第1期

机译：普通话语音识别的音节语言模型：利用字符语言模型
2. Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System [J] . S. SARASWATHI, T.V. GEETHA ACM transactions on Asian language information processing . 2007,第3期

机译：增强的基于词素的语言模型与不同的基于单词的语言模型的性能比较，以提高泰米尔语语音识别系统的性能
3. Modeling under-resourced languages for speech recognition [J] . Kurimo Mikko, Enarvi Seppo, Tilk Ottokar, Language Resources and Evaluation . 2017,第4期

机译：为语音识别建模资源不足的语言
4. Speech Recognition with a Seamlessly Updated Language Model for Real-Time Closed-Captioning [C] . Toru Imai, Shinichi Homma, Akio Kobayashi, Annual conference of the International Speech Communication Association . 2010

机译：用无缝更新的语言模型进行语音识别，用于实时关闭字幕
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. REAL-TIME ONE-PASS DECODING WITH RECURRENT NEURAL NETWORK LANGUAGE MODEL FOR SPEECH RECOGNITION [O] . Takaaki Hori, Yotaro Kubo, Atsushi Nakamura 2014

机译：语音识别的递归神经网络语言模型实时单通解码

Speech Recognition with a Seamlessly Updated Language Model for Real-Time Closed-Captioning

摘要

著录项

相似文献

相关主题

期刊订阅