...
首页> 外文期刊>Computer vision and image understanding >A multi-modal system for the retrieval of semantic video events
【24h】

A multi-modal system for the retrieval of semantic video events

机译:用于检索语义视频事件的多模式系统

获取原文
获取原文并翻译 | 示例
           

摘要

A framework for event detection is proposed where events, objects, and other semantic concepts are detected from video using trained classifiers. These classifiers are used to automatically annotate video with semantic labels, which in turn are used to search for new, untrained types of events and semantic concepts. The novelty of the approach lies in the: (1) semi-automatic construction of models of events from feature descriptors and (2) integration of content-based and concept-based querying in the search process. Speech retrieval is independently applied and combined results are produced. Results of applying these to the Search benchmark of the NIST TREC Video track 2001 are reported, and the gained experience and future work are discussed. (C) 2004 Published by Elsevier Inc.
机译:提出了一种事件检测框架,其中使用训练有素的分类器从视频中检测事件,对象和其他语义概念。这些分类器用于使用语义标签自动注释视频,而语义标签又用于搜索新的,未经训练的事件和语义概念。该方法的新颖之处在于:(1)来自特征描述符的事件模型的半自动构建,以及(2)在搜索过程中集成基于内容和基于概念的查询。语音检索是独立应用的,并且会产生组合结果。报告了将这些结果应用到NIST TREC Video track 2001的Search基准中的结果,并讨论了获得的经验和未来的工作。 (C)2004由Elsevier Inc.出版

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号