【24h】

Multimodal Video Database Modeling, Querying and Browsing

机译:多模式视频数据库建模,查询和浏览

获取原文
获取原文并翻译 | 示例

摘要

In this paper, a multimodal video indexing and retrieval system, MMVIRS, is presented. MMVIRS models the auditory, visual, and textual sources of video collections from a semantic perspective. Besides multimodal-ity, our model is constituted on semantic hierarchies that enable us to access the video from different semantic levels. MMVIRS has been implemented with data annotation, querying and browsing parts. In the annotation part, metadata information and video semantics are extracted in hierarchical ways. In the querying part, semantic queries, spatial queries, regional queries, spatio-temporal queries, and temporal queries have been processed over video collections using the proposed model. In the browsing parts, video collections are navigated using category information, visual and auditory hierarchies.
机译:本文提出了一种多模式视频索引和检索系统MMVIRS。 MMVIRS从语义角度对视频集的听觉,视觉和文本来源进行建模。除了多模式性之外,我们的模型还建立在语义层次结构上,使我们能够从不同的语义级别访问视频。 MMVIRS已通过数据注释,查询和浏览部分实现。在注释部分中,以分层方式提取元数据信息和视频语义。在查询部分,已使用建议的模型通过视频集合处理了语义查询,空间查询,区域查询,时空查询和时间查询。在浏览部分中,使用类别信息,视觉和听觉层次结构来导航视频集合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号