首页> 外文会议>INTERSPEECH 2012 >On-the-fly Topic Adaptation for YouTube Video Transcription
【24h】

On-the-fly Topic Adaptation for YouTube Video Transcription

机译:适用于YouTube视频转录的动态主题适应

获取原文

摘要

Automatic closed-captioning of video is a useful application of speech recognition technology but poses numerous challenges when applied to open-domain user-uploaded videos such as those on YouTube. In this work, we explore a strategy to improve decoding accuracy for video transcription by decoding each video with a language model (LM) adapted specifically to the topics that the video covers. Taxonomic topic classifiers are used to determine the topic content of videos and to build a large set of topic-specific LMs from web documents. We consider strategies for selecting and interpolating LMs in both supervised and unsupervised scenarios in a two-pass lattice rescoring framework. Experiments on a YouTube video corpus show a 3.6 absolute reduction in WER over generic single-pass transcriptions as well as a statistically significant 0.8 absolute improvement over rescoring with a very large non-adapted LM built from all the documents.
机译:视频的自动关闭标题是语音识别技术的有用应用,但在应用于开放域用户上传的视频(例如YouTube)时造成许多挑战。在这项工作中,我们通过用语言模型(LM)解码专门的语言模型(LM)来探讨提高视频转录的解码准确性的策略。分类学主题分类器用于确定视频的主题内容,并从Web文档构建大量的特定主题LMS。我们考虑在双通晶格救援框架中选择和翻译LMS的策略。 youtube视频语料库上的实验显示了一般性单通转录的WER 3.6绝对减少,以及通过从所有文档构建的非常大的非适应LM,统计上显着的0.8绝对改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号