【24h】

Speech Retrieval for TV News Programs by Fusing the Audio and Video Information

机译:通过融合音频和视频信息检索电视新闻节目的语音

获取原文
获取原文并翻译 | 示例

摘要

A typical news story contains a brief report by the anchor person(s) in the studio, as well as news footage in the field. Investigation shows that our recognizer performs better when indexing audio from the studio than that from the field. In order to automatically extract the "reliable" audio segments for speech retrieval, we attempt to detect studio-to-field transitions by means of video parsing. Our research is based on 146 news stories collected from Hong Kong TVB Jade station. Retrieval using the entire audio track gave (average inverse rank) AIR=0.759. while,with the incorporation of video parsing, we performed retrieval based only on the studio recordings, which produced AIR=0.765.
机译:一个典型的新闻故事包含演播室中主持人的简短报告以及现场的新闻镜头。调查显示,在对录音室的音频进行索引时,我们的识别器比现场的识别器性能更好。为了自动提取“可靠”的音频片段以进行语音检索,我们尝试通过视频解析来检测演播室到现场的过渡。我们的研究基于从香港无线电视翡翠台搜集的146个新闻报道。使用整个音频轨道进行检索得到的AIR = 0.759(平均倒数排名)。同时,由于结合了视频解析,我们仅根据演播室录音进行检索,从而产生AIR = 0.765。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号