Structuring Lecture Videos by Automatic Projection Screen Localization and Analysis

Li Kai; Wang Jue; Wang Haoqian; Dai Qionghai

首页> 外文期刊>Pattern Analysis and Machine Intelligence, IEEE Transactions on >Structuring Lecture Videos by Automatic Projection Screen Localization and Analysis

【24h】

Structuring Lecture Videos by Automatic Projection Screen Localization and Analysis

机译：通过自动投影屏幕本地化和分析来构建演讲视频

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present a fully automatic system for extracting the semantic structure of a typical academic presentation video, which captures the whole presentation stage with abundant camera motions such as panning, tilting, and zooming. Our system automatically detects and tracks both the projection screen and the presenter whenever they are visible in the video. By analyzing the image content of the tracked screen region, our system is able to detect slide progressions and extract a high-quality, non-occluded, geometrically-compensated image for each slide, resulting in a list of representative images that reconstruct the main presentation structure. Afterwards, our system recognizes text content and extracts keywords from the slides, which can be used for keyword-based video retrieval and browsing. Experimental results show that our system is able to generate more stable and accurate screen localization results than commonly-used object tracking methods. Our system also extracts more accurate presentation structures than general video summarization methods, for this specific type of video.

机译：我们提供了一种用于提取典型学术演示视频语义结构的全自动系统，该系统可通过诸如平移，倾斜和缩放等丰富的摄像机动作来捕获整个演示阶段。只要在视频中可见，我们的系统就会自动检测并跟踪投影屏幕和演示者。通过分析跟踪的屏幕区域的图像内容，我们的系统能够检测幻灯片进度，并为每张幻灯片提取高质量，无遮挡，几何补偿的图像，从而生成可重建主要演示文稿的代表性图像列表结构体。之后，我们的系统识别文本内容并从幻灯片中提取关键字，这些关键字可用于基于关键字的视频检索和浏览。实验结果表明，与常用的对象跟踪方法相比，我们的系统能够生成更稳定，更准确的屏幕定位结果。对于这种特定类型的视频，我们的系统还提取了比常规视频摘要方法更准确的表示结构。

著录项

来源
《Pattern Analysis and Machine Intelligence, IEEE Transactions on》 |2015年第6期|1233-1246|共14页
作者
Li Kai; Wang Jue; Wang Haoqian; Dai Qionghai;
展开▼
作者单位

Department of Automation, Tsinghua National Laboratory for Information Science and Technology (TNList), Tsinghua University, Beijing, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Cameras; Educational institutions; Feature extraction; Semantics; Trajectory; Videos; Visualization; Lecture video; presentation video; projection screen localization; video structuring; video summarization;

机译：相机;教育机构;特征提取;语义;轨迹;视频;可视化;讲座视频;演示视频;投影屏定位;视频结构;视频摘要;

相似文献

外文文献
中文文献
专利

1. Automatic Video Recording of Lecture's Audience with Activity Analysis and Equalization of Scale for Students Observation [J] . Satoshi Nishiguchi, Yoshinari Kameda, Koh Kakusho, Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2004,第2期

机译：讲座的观众自动视频录制，具有活动分析和比例均衡的学生观察
2. Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis [J] . Wang F, Ngo CW, Pong TC Pattern Recognition: The Journal of the Pattern Recognition Society . 2008,第10期

机译：通过视频文本分析构建低质量的录像演讲课，以供交叉引用浏览
3. PERFORMANCE AND ANALYSIS OF AUTOMATIC LICENSE PLATE LOCALIZATION AND RECOGNITION FROM VIDEO SEQUENCES [J] . B.Thamilvalluvan, Priyanka Paree Alphonse, D.R.Thendralarasi, International Journal on Smart Sensing and Intelligent Systems . 2017,第SPECIALaISSUE期

机译：视频序列自动牌照本地化和识别的性能与分析
4. New optical designs for large-screen two- and three-dimensional video projection with enhanced screen brightness and no visible pixel or line structure [C] . Eugene Dolgoff, Projectavision, Inc., Projection Displays . 1995

机译：用于大屏幕二维和三维视频投影的新光学设计，具有增强的屏幕亮度且无可见像素或线条结构
5. A Makeover for the Captured Lecture: Applying Multimedia Learning Principles to Lecture Video [D] . Lamb, Richard Alan 2015

机译：捕获讲座的改造：将多媒体学习原则应用于讲座视频
6. Two step convolutional neural network for automatic glottis localization and segmentation in stroboscopic videos [O] . Varun Belagali, Achuth Rao M V, Pebbili Gopikishore, 2020

机译：用于自动发光本地化和频闪视频分割的两步卷积神经网络
7. Temporal HeartNet: Towards Human-Level Automatic Analysis of Fetal Cardiac Screening Video [O] . Huang, W, Bridge, CP, Noble, JA, 2017

机译：Temporal HeartNet：走向人类水平的胎儿心脏筛查视频自动分析

Structuring Lecture Videos by Automatic Projection Screen Localization and Analysis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅