首页> 外文会议>Multimedia Content Analysis, Management, and Retrieval 2006 >A Tree-based Paradigm for Content-based Video Retrieval and Management
【24h】

A Tree-based Paradigm for Content-based Video Retrieval and Management

机译:基于树的范例,用于基于内容的视频检索和管理

获取原文
获取原文并翻译 | 示例

摘要

As video databases become increasingly important for full exploitation of multimedia resources, this paper aims at describing our recent efforts in feasibility studies towards building up a content-based and high-level video retrieval/management system. The study is focused on constructing a semantic tree structure via combination of low-level image processing techniques and high-level interpretation of visual content. Specifically, two separate algorithms were developed to organise input videos in terms of two layers: the shot layer and the key-frame layer. While the shot layer is derived by developing a multi-featured shot cut detection, the key frame layer is extracted automatically by a genetic algorithm. This paves the way for applying pattern recognition techniques to analyse those key frames and thus extract high level information to interpret the visual content or objects. Correspondingly, content-based video retrieval can be conducted in three stages. The first stage is to browse the digital video via the semantic tree at structural level, the second stage is match the key frame in terms of low-level features, such as colour, shape of objects, and texture etc. Finally, the third stage is to match the high-level information, such as conversation with indoor background, moving vehicles along a seaside road etc. Extensive experiments are reported in this paper for shot cut detection and key frame extraction, enabling the tree structure to be constructed.
机译:随着视频数据库对于充分利用多媒体资源变得越来越重要,本文旨在描述我们最近在进行可行性研究以构建基于内容的高级视频检索/管理系统方面所做的努力。该研究的重点是通过低级图像处理技术和可视内容的高级解释相结合来构造语义树结构。具体来说,开发了两种单独的算法来按两层组织输入视频:镜头层和关键帧层。通过开发多功能的镜头剪切检测来导出镜头层,而关键帧层则通过遗传算法自动提取。这为应用模式识别技术分析那些关键帧并提取高级信息以解释视觉内容或对象铺平了道路。相应地,基于内容的视频检索可以分三个阶段进行。第一阶段是在结构级别上通过语义树浏览数字视频,第二阶段是在关键级别上匹配低级特征,例如颜色,对象的形状和纹理等。最后,第三阶段为了匹配高层信息,例如与室内背景的对话,沿着海边道路行驶的车辆等。本文报道了用于镜头剪切检测和关键帧提取的大量实验,从而可以构建树结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号