Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

机译：DCT压缩域中的自动视频字幕检测和提取

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The text in a video frame can help us to understand the semantics of video content directly. Although there are many approaches that can automatically detect and localize text a video, most of them use the original pixels of an image to find the text regions. In this paper, we present an approach to automatically localize captions in MPEG compressed videos. Caption regions are segmented from background by using their distinguishing texture characteristics. Unlike previously published ones which fully decompress the video sequence before extracting the caption regions or only extract text regions in Intra-(I-) frames, our approach detect and localize caption regions directly in the DCT compressed domain. Therefore, only very small amounts of decoding processes are required. Experiments show that a good caption detection rate can be obtained, and the average recalls of Intra- and Inter-frame detections are 97.77% and 97.84%, respectively.

机译：视频帧中的文本可以帮助我们直接理解视频内容的语义。尽管有许多方法可以自动检测和定位视频文本，但是大多数方法都使用图像的原始像素来查找文本区域。在本文中，我们提出了一种自动定位MPEG压缩视频中字幕的方法。字幕区域通过使用其独特的纹理特征从背景中分割出来。与以前发布的在提取字幕区域之前完全解压缩视频序列或仅提取Intra-（I-）帧中的文本区域的视频不同，我们的方法直接在DCT压缩域中检测和定位字幕区域。因此，仅需要非常少量的解码处理。实验表明，字幕检测率较高，帧内和帧间检测的平均召回率分别为97.77％和97.84％。

著录项

来源
《Visual Communications and Image Processing 2005 pt.2》|2005年|P.895-907|共13页
会议地点 Beijing(CN)
作者
Chin-Fu Tsao; Yu-Hao Chen; Jin-Hau Kuo; Chia-wei Lin; Ja-Ling Wu;
展开▼
作者单位

Communication Multimedia Laboratory, Graduate Institute of Networking Multimedia National Taiwan University, Taipei, Taiwan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类图像通信、多媒体通信;
关键词
caption; compressed domain;

机译：标题;压缩域;

相似文献

外文文献
中文文献
专利

1. An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain [J] . Jongho Nang, Seungwook Hong, Ohyeong Kwon IEICE Transactions on Communications . 2001,第8期

机译：MC-DCT压缩域中的MPEG视频的有效字幕插入方案
2. Automatic video activity detection using compressed domain motion trajectories for H.264 videos [J] . Haowei Liu, Ming-Ting Sun, Ruei-Cheng Wu, Journal of visual communication & image representation . 2011,第5期

机译：使用压缩域运动轨迹对H.264视频进行自动视频活动检测
3. Automatic caption localization in compressed video [J] . Yu Zhong, Hongjiang Zhang IEEE Transactions on Pattern Analysis and Machine Intelligence . 2000,第4期

机译：压缩视频中的字幕自动定位
4. Caption Text Extraction Using DCT Feature in MPEG Compressed Video [C] . Jiangbo Xu, Xiuhua Jiang, Yuxia Wang WRI World Congress on Computer Science and Information Engineering . 2009

机译：在MPEG压缩视频中使用DCT功能提取字幕文本
5. Computationally scalable spatial resizing of DCT domain compressed images and video. [D] . Salazar, Carlos LeRoy. 2007

机译：DCT域压缩图像和视频的计算可缩放空间大小调整。
6. Evaluation of automatic video captioning using direct assessment [O] . Yvette Graham, George Awad, Alan Smeaton 2012

机译：使用直接评估来评估自动视频字幕
7. Compressed domain video indexing techniques using DCT and motion vector information in MPEG video [O] . Vikrant Kobla, David Doermann, King-Ip (David) Lin, 1997

机译：在mpEG视频中使用DCT和运动矢量信息的压缩域视频索引技术
8. Qualitive Detection of Independently Moving Targets in MPEG Video Within the Compressed Domain [R] . Zhang, Z. 2004

机译：压缩域内mpEG视频中独立运动目标的定性检测

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅