首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Speech Recognition with a Seamlessly Updated Language Model for Real-Time Closed-Captioning
【24h】

Speech Recognition with a Seamlessly Updated Language Model for Real-Time Closed-Captioning

机译:具有无缝更新语言模型的语音识别,用于实时隐藏字幕

获取原文

摘要

It is desirable to consistently and seamlessly update a language model of speech recognition without stopping it for online applications such as real-time closed-captioning. This paper proposes a novel speech recognition system that enables the model to be updated at any time even while it is running. It can run the second decoder with the latest model in parallel, and their priority that must be accessed is controlled at a non-speech portion by an additional job process, which sends acoustic features only to an active target decoder with the latest model and sends recognized words to the backend manual error correction for closed-captioning. The system seamlessly updates the model and ensures endless speech recognition with the latest model at any time. Our new practical real-time closed-captioning system reduced word errors by two thirds with the proposed language model update mechanism in the speech recognition and captioning experiments for Japanese broadcast news programs.
机译:理想的是一致且无缝地更新语音识别的语言模型而不停止在线应用程序(例如实时隐藏字幕)的语言模型。本文提出了一种新颖的语音识别系统,该系统即使在模型运行时也可以随时对其进行更新。它可以并行运行具有最新模型的第二个解码器,并且必须通过额外的作业过程在非语音部分控制必须访问的优先级,该过程仅将声学特征发送给具有最新模型的活动目标解码器,并发送识别的字词,用于后端手动错误更正,用于隐藏式字幕。该系统无缝更新模型,并确保随时使用最新模型进行无尽的语音识别。在日本广播新闻节目的语音识别和字幕实验中,我们提出的语言模型更新机制为我们的新型实用实时字幕系统提供了建议的语言模型更新机制,将单词错误减少了三分之二。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号