首页> 外文会议>IPTA 2012;International Conference on Image Processing Theory, Tools and Applications >Retina-Enhanced SURF Descriptors for Semantic Concept Detection in Videos
【24h】

Retina-Enhanced SURF Descriptors for Semantic Concept Detection in Videos

机译:视网膜增强的冲浪描述符,用于视频中的语义概念检测

获取原文

摘要

This paper proposes to investigate the potential benefit of the use of low-level human vision behaviors in the context of high-level semantic concept detection. A large part of the current approaches relies on the Bag-of-Words (BoW) model, which has proven itself to be a good choice especially for object recognition in images. Its extension from static images to video sequences exhibits some new problems to cope with, mainly the way to use the added temporal dimension for detecting the target concepts (swimming, drinking...). In this study, we propose to apply a human retina model to preprocess video sequences, before constructing a State-Of-The-Art BoW analysis. This preprocessing, designed in a way that enhances the appearance especially of static image elements, increases the performance by introducing robustness to traditional image and video problems, such as luminance variation, shadows, compression artifacts and noise. These approaches are evaluated on the TrecVid 2010 Semantic Indexing task datasets, containing 130 high-level semantic concepts. We consider the well-known SURF descriptor as the entry point of the BoW system, but this work could be extended to any other local gradient based descriptor.
机译:本文建议调查在高级别语义概念检测的背景下使用低级人类视力行为的潜在好处。当前方法的大部分方法依赖于单词袋(弓)模型,这已被证明是一种良好的选择,特别是在图像中的对象识别。它从静态图像到视频序列的扩展表现出一些新的问题,以应对一些新的问题,主要是使用增加的时间维度来检测目标概念(游泳,饮酒......)。在这项研究中,我们建议在构建最先进的弓分析之前将人视网膜模型应用于预处理视频序列。这种预处理,以一种增强静态图像元素的外观的方式设计,通过向传统图像和视频问题引入鲁棒性,例如亮度变化,阴影,压缩伪像和噪声来增加性能。在Trecvid 2010语义索引任务数据集上评估这些方法,其中包含130个高级语义概念。我们将众所周知的冲浪描述符作为弓系统的入口点,但是这项工作可以扩展到任何其他基于梯度的基于梯度的描述符。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号