首页> 外文期刊>IEEE transactions on multimedia >Summarization of Visual Content in Instructional Videos
【24h】

Summarization of Visual Content in Instructional Videos

机译:教学视频中视觉内容的概述

获取原文
获取原文并翻译 | 示例
           

摘要

In instructional videos of chalk board presentations, the visual content refers to the text and figures written on the boards. Existing methods on video summarization are not effective for this video domain because they are mainly based on low-level image features such as color and edges. In this work, we present a novel approach to summarizing the visual content in instructional videos using middle-level features. We first develop a robust algorithm to extract content text and figures from instructional videos by statistical modelling and clustering. This algorithm addresses the image noise, nonuniformity of the board regions, camera movements, occlusions, and other challenges in the instructional videos that are recorded in real classrooms. Using the extracted text and figures as the middle level features, we retrieve a set of key frames that contain most of the visual content. We further reduce content redundancy and build a mosaicked summary image by matching extracted content based on K-th Hausdorff distance and connected component decomposition. Performance evaluation on four full-length instructional videos shows that our algorithm is highly effective in summarizing instructional video content.
机译:在粉笔板演示文稿的教学视频中,视觉内容是指书写在板上的文字和图形。现有的视频汇总方法对于该视频领域无效,因为它们主要基于低级图像特征(例如颜色和边缘)。在这项工作中,我们提出了一种新颖的方法,可以使用中级功能总结教学视频中的视觉内容。我们首先开发了一种强大的算法,可以通过统计建模和聚类从教学视频中提取内容文本和图形。该算法解决了图像噪声,木板区域的不均匀性,相机移动,遮挡以及在真实教室中录制的教学视频中的其他挑战。使用提取的文本和图形作为中间层功能,我们检索了一组包含大部分视觉内容的关键帧。我们进一步减少内容冗余,并通过基于第K个Hausdorff距离和连接的分量分解匹配提取的内容来构建镶嵌的摘要图像。对四个全长教学视频的性能评估表明,我们的算法在总结教学视频内容方面非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号