首页> 外文期刊>Systems and Computers in Japan >Highlight Detection and Indexing in Broadcast Sports Video by Collaborative Processing of Text, Audio, and Image
【24h】

Highlight Detection and Indexing in Broadcast Sports Video by Collaborative Processing of Text, Audio, and Image

机译:通过文本,音频和图像的协同处理,在广播体育视频中进行亮点检测和索引

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper we propose a highlight detection method and an indexing method for broadcast sports video using the collaborative processing of text, audio, and image. In the proposed method, the appearance pattern of words in the closed caption text stream is analyzed, and candidate intervals for highlights are detected. Next, these intervals are checked based on their audio levels, and those which seem to be erroneous are rejected. Finally, the resulting highlight intervals are segmented into shots, and the shots are indexed by identifying the highlight shots based on audio levels and dominant color information. In the results of using this method on a real football broadcast, highlight intervals were effectively detected with a recall rate of 77% and a precision rate of 84%. Moreover, for the intervals in which highlights were correctly detected, shot indexing was performed accurately 75% for only the first candidate, and 97% for up to the second candidate. We verified experimentally that efficient processing can be achieved by step-wise analysis of the text, audio, and image.
机译:在本文中,我们提出了一种通过文本,音频和图像的协同处理为广播体育视频提供的亮点检测方法和索引方法。在该方法中,分析了隐藏字幕文本流中单词的出现模式,并检测了亮点的候选间隔。接下来,根据它们的音频电平检查这些间隔,并拒绝那些看似错误的间隔。最后,将得到的高光间隔划分为多个镜头,并通过基于音频级别和主要色彩信息识别高光镜头来对镜头进行索引。在实际的足球广播中使用此方法的结果是,有效地检测了高光间隔,其召回率为77%,准确率为84%。此外,对于正确检测到高光的时间间隔,仅对第一个候选者准确地执行了75%的镜头索引,对于第二个候选者准确地执行了97%的镜头索引。我们通过实验验证了可以通过逐步分析文本,音频和图像来实现有效的处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号