【24h】

Video retrieval: content analysis by ImageMiner

机译:视频检索:ImageMiner的内容分析

获取原文

摘要

Abstract: In this paper videos are analyzed to get a content-based description of the video. The structure of a given video is useful to index long videos efficiently and automatically. A comparison between shots gives an overview about cut frequency, cut pattern, and scene bounds. After a shot detection the shots are grouped into clusters based on their visual similarity. A time-constraint clustering procedure is used to compare only those shots that are positioned inside a time range. Shots from different areas of the video (e.g., begin/end) are not compared. With this cluster information that contains a list about shots and their clusters it is possible to calculate scene bounds. A labeling of all clusters gives a declaration about the cut pattern. It is easy now to distinguish a dialogue from an action scene. The final content analysis is done by the ImageMiner$+TM$/ system. The ImageMiner system developed at the University of Bremen of the Image Processing Department of the Center for Computing Technology realizes content-based image retrieval for still images through a novel combination of methods and techniques of computer vision and artificial intelligence. The ImageMiner system consists of three analysis modules for computer vision, namely for color, texture, and contour analysis. Additionally exists a module for object recognition. The output of the object recognition module can be indexed by a text retrieval system. Thus, concepts like forestscene may be searched for. We combine the still image analysis with the results of the video analysis in order to retrieve shots or scenes. !30
机译:摘要:本文对视频进行了分析,以获取基于内容的视频描述。给定视频的结构对于有效且自动地将长视频编入索引很有用。镜头之间的比较概述了剪切频率,剪切模式和场景范围。镜头检测后,根据它们的视觉相似性将镜头分组。时间约束聚类过程仅用于比较位于时间范围内的那些镜头。不比较来自视频不同区域(例如开始/结束)的镜头。使用包含有关镜头及其簇的列表的此簇信息,可以计算场景范围。所有簇的标签给出了有关切割模式的声明。现在很容易将对话与动作场景区分开。最终的内容分析由ImageMiner $ + TM $ /系统完成。由计算机技术中心不来梅大学图像处理系开发的ImageMiner系统通过计算机视觉和人工智能的方法和技术的新颖结合,实现了基于内容的静止图像图像检索。 ImageMiner系统包含用于计算机视觉的三个分析模块,即颜色,纹理和轮廓分析。此外,还存在一个用于对象识别的模块。对象识别模块的输出可以由文本检索系统建立索引。因此,可以搜索诸如森林场景的概念。我们将静止图像分析与视频分析结果结合在一起,以检索镜头或场景。 !30

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号