首页> 外文会议>Visual Communications and Image Processing 2005 pt.2 >Automatic Video Caption Detection and Extraction in the DCT Compressed Domain
【24h】

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

机译:DCT压缩域中的自动视频字幕检测和提取

获取原文
获取原文并翻译 | 示例

摘要

The text in a video frame can help us to understand the semantics of video content directly. Although there are many approaches that can automatically detect and localize text a video, most of them use the original pixels of an image to find the text regions. In this paper, we present an approach to automatically localize captions in MPEG compressed videos. Caption regions are segmented from background by using their distinguishing texture characteristics. Unlike previously published ones which fully decompress the video sequence before extracting the caption regions or only extract text regions in Intra-(I-) frames, our approach detect and localize caption regions directly in the DCT compressed domain. Therefore, only very small amounts of decoding processes are required. Experiments show that a good caption detection rate can be obtained, and the average recalls of Intra- and Inter-frame detections are 97.77% and 97.84%, respectively.
机译:视频帧中的文本可以帮助我们直接理解视频内容的语义。尽管有许多方法可以自动检测和定位视频文本,但是大多数方法都使用图像的原始像素来查找文本区域。在本文中,我们提出了一种自动定位MPEG压缩视频中字幕的方法。字幕区域通过使用其独特的纹理特征从背景中分割出来。与以前发布的在提取字幕区域之前完全解压缩视频序列或仅提取Intra-(I-)帧中的文本区域的视频不同,我们的方法直接在DCT压缩域中检测和定位字幕区域。因此,仅需要非常少量的解码处理。实验表明,字幕检测率较高,帧内和帧间检测的平均召回率分别为97.77%和97.84%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号