Automatic Close Captioning for Live Hungarian Television Broadcast Speech: A Fast and Resource-Efficient Approach

机译：匈牙利直播电视广播语音的自动隐藏字幕：一种快速且节省资源的方法

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, the application of LVCSR (Large Vocabulary Continuous Speech Recognition) technology is investigated for real-time, resource-limited broadcast close captioning. The work focuses on transcribing live broadcast conversation speech to make such programs accessible to deaf viewers. Due to computational limitations, real time factor (RTF) and memory requirements are kept low during decoding with various models tailored for Hungarian broadcast speech recognition. Two decoders are compared on the direct transcription task of broadcast conversation recordings, and setups employing re-speakers are also tested. Moreover, the models are evaluated on a broadcast news transcription task as well, and different language models (LMs) are tested in order to demonstrate the performance of our systems in settings when low memory consumption is a less crucial factor.

机译：本文研究了LVCSR（大词汇量连续语音识别）技术在实时，资源有限的广播隐藏字幕中的应用。这项工作着重于录制现场广播对话语音，以使聋哑观众可以访问此类节目。由于计算上的限制，在解码期间，使用针对匈牙利广播语音识别而定制的各种模型，实时因子（RTF）和内存要求保持较低。在广播对话录音的直接转录任务上比较了两个解码器，还测试了使用扬声器的设置。此外，还可以在广播新闻转录任务上对模型进行评估，并测试不同的语言模型（LM），以证明在内存消耗量较小的情况下，我们的系统在设置中的性能。

著录项

来源
《International Conference on speech and computer》|2015年|105-112|共8页
会议地点
作者
Adam Varga; Balazs Tarjan; Zoltan Tobler; Gyoergy Szaszak; Tibor Fegyo; Csaba Bordas; Peter Mihajlik;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech recognition; LVCSR; Broadcast news; Broadcast conversation; GMM; DNN; Hungarian; Kaldi; Limited resources;

机译：语音识别; LVCSR;广播新闻;广播对话; GMM; DNN;匈牙利;卡尔迪有限的资源;

相似文献

外文文献
中文文献
专利

1. Online Speech Detection and Dual-Gender Speech Recognition for Captioning Broadcast News [J] . NHK技研 R&D . 2009,第114期

机译：字幕新闻的在线语音检测和双性别语音识别
2. Online Speech Detection And Dual-gender Speech Recognition For Captioning Broadcast News [J] . Toru IMAI, Shoei SATO, Shinichi HOMMA, IEICE Transactions on Information and Systems . 2007,第8期

机译：字幕新闻的在线语音检测和双性别语音识别
3. An Automatic Video Reinforcing System for TV Programs using Semantic Metadata from Closed Captions [J] . Yuanyuan Wang, Daisuke Kitayama, Yukiko Kawai, International journal of multimedia data engineering & management . 2016,第1期

机译：使用隐藏式字幕的语义元数据的电视节目自动视频增强系统
4. Automatic Close Captioning for Live Hungarian Television Broadcast Speech: A Fast and Resource-Efficient Approach [C] . Adam Varga, Balazs Tarjan, Zoltan Tobler, Speech and Computer International Conference . 2015

机译：匈牙利电视广播的自动关闭标题：快速和资源有效的方法
5. Still 'Live at the Scene': An Exploration of Timely Television News Broadcasts Repurposed as Online Content. [D] . Ware, Jennifer Marie. 2012

机译：仍然是“现场直播”：及时电视新闻广播改编为在线内容的探索。
6. TV: Brain surgery gets its first live UK broadcast [O] . Kristina Fister 2004

机译：电视：脑外科手术首次在英国直播
7. ‘We go green in Beijing’: situating live television, urban motor sport and environmental sustainability by means of a framing analysis of TV broadcasts of Formula E [O] . Timothy Robeers 2019

机译：“我们在北京走绿色”：借助于公式e的电视广播的框架分析，情况电视，城市运动和环境可持续性

Automatic Close Captioning for Live Hungarian Television Broadcast Speech: A Fast and Resource-Efficient Approach

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅