首页> 外文会议>International Conference on speech and computer >Automatic Close Captioning for Live Hungarian Television Broadcast Speech: A Fast and Resource-Efficient Approach
【24h】

Automatic Close Captioning for Live Hungarian Television Broadcast Speech: A Fast and Resource-Efficient Approach

机译:匈牙利直播电视广播语音的自动隐藏字幕:一种快速且节省资源的方法

获取原文
获取外文期刊封面目录资料

摘要

In this paper, the application of LVCSR (Large Vocabulary Continuous Speech Recognition) technology is investigated for real-time, resource-limited broadcast close captioning. The work focuses on transcribing live broadcast conversation speech to make such programs accessible to deaf viewers. Due to computational limitations, real time factor (RTF) and memory requirements are kept low during decoding with various models tailored for Hungarian broadcast speech recognition. Two decoders are compared on the direct transcription task of broadcast conversation recordings, and setups employing re-speakers are also tested. Moreover, the models are evaluated on a broadcast news transcription task as well, and different language models (LMs) are tested in order to demonstrate the performance of our systems in settings when low memory consumption is a less crucial factor.
机译:本文研究了LVCSR(大词汇量连续语音识别)技术在实时,资源有限的广播隐藏字幕中的应用。这项工作着重于录制现场广播对话语音,以使聋哑观众可以访问此类节目。由于计算上的限制,在解码期间,使用针对匈牙利广播语音识别而定制的各种模型,实时因子(RTF)和内存要求保持较低。在广播对话录音的直接转录任务上比较了两个解码器,还测试了使用扬声器的设置。此外,还可以在广播新闻转录任务上对模型进行评估,并测试不同的语言模型(LM),以证明在内存消耗量较小的情况下,我们的系统在设置中的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号