Retina-Enhanced SURF Descriptors for Semantic Concept Detection in Videos

机译：视网膜增强的冲浪描述符，用于视频中的语义概念检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes to investigate the potential benefit of the use of low-level human vision behaviors in the context of high-level semantic concept detection. A large part of the current approaches relies on the Bag-of-Words (BoW) model, which has proven itself to be a good choice especially for object recognition in images. Its extension from static images to video sequences exhibits some new problems to cope with, mainly the way to use the added temporal dimension for detecting the target concepts (swimming, drinking...). In this study, we propose to apply a human retina model to preprocess video sequences, before constructing a State-Of-The-Art BoW analysis. This preprocessing, designed in a way that enhances the appearance especially of static image elements, increases the performance by introducing robustness to traditional image and video problems, such as luminance variation, shadows, compression artifacts and noise. These approaches are evaluated on the TrecVid 2010 Semantic Indexing task datasets, containing 130 high-level semantic concepts. We consider the well-known SURF descriptor as the entry point of the BoW system, but this work could be extended to any other local gradient based descriptor.

机译：本文建议调查在高级别语义概念检测的背景下使用低级人类视力行为的潜在好处。当前方法的大部分方法依赖于单词袋（弓）模型，这已被证明是一种良好的选择，特别是在图像中的对象识别。它从静态图像到视频序列的扩展表现出一些新的问题，以应对一些新的问题，主要是使用增加的时间维度来检测目标概念（游泳，饮酒......）。在这项研究中，我们建议在构建最先进的弓分析之前将人视网膜模型应用于预处理视频序列。这种预处理，以一种增强静态图像元素的外观的方式设计，通过向传统图像和视频问题引入鲁棒性，例如亮度变化，阴影，压缩伪像和噪声来增加性能。在Trecvid 2010语义索引任务数据集上评估这些方法，其中包含130个高级语义概念。我们将众所周知的冲浪描述符作为弓系统的入口点，但是这项工作可以扩展到任何其他基于梯度的基于梯度的描述符。

著录项

来源
《IPTA 2012;International Conference on Image Processing Theory, Tools and Applications》|2012年||共6页
会议地点
作者
Sabin Tiberius STRAT; Alexandre BENOIT; Patrick LAMBERT; Alice CAPLIER;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-532;
关键词
Bag of words; Retina analysis; Retina preprocessing; Semantics; SURF; Video content; Video indexation;

机译：袋子;视网膜分析;视网膜预处理;语义;冲浪;视频内容;视频索引;

相似文献

外文文献
中文文献
专利

1. Semantic Video Concept Detection using Subspace-partition based Scheme - TRECVid 2012 Semantic Video Concept Detection by NTT [J] . Yongqing SUN, Kyoko SUDO, Yukinobu TANIGUCHI, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2012,第441期

机译：使用基于子空间分区的方案进行语义视频概念检测-TRECVid 2012 NTT进行语义视频概念检测
2. Multi-label semantic concept detection in videos using fusion of asymmetrically trained deep convolutional neural networks and foreground driven concept co-occurrence matrix [J] . Janwe Nitin J., Bhoyar Kishor K. Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2018,第8期

机译：使用非对称训练的深卷积神经网络和前景驱动概念共发生矩阵的视频中的多标签语义概念检测
3. Near-Duplicate Video Clip Detection Using Model-Free Semantic Concept Detection and Adaptive Semantic Distance Measurement [J] . Min H.-S., Choi J. Y., De Neve W., Circuits and Systems for Video Technology, IEEE Transactions on . 2012,第8期

机译：使用无模型语义概念检测和自适应语义距离测量的近重复视频剪辑检测
4. Retina-enhanced SURF descriptors for semantic concept detection in videos [C] . Strat Sabin Tiberius, Benoit Alexandre, Lambert Patrick, 2012 3rd International Conference on Image Processing Theory, Tools and Applications. . 2012

机译：视网膜增强型SURF描述符，用于视频中的语义概念检测
5. Advanced techniques for semantic concept detection in general videos. [D] . Jiang, Wei. 2010

机译：普通视频中语义概念检测的高级技术。
6. Semantic based concept differential retrieval equivalence detection in clinical terms version 3 (Read Codes). [O] . P. J. Brown, C. Price 1999

机译：临床术语第3版（阅读代码）中基于语义的概念差异检索和对等检测。
7. Retina-Enhanced SURF Descriptors for Semantic Concept Detection in Videos [O] . Strat, Tiberius, Benoit, Alexandre, Lambert, Patrick, 2012

机译：用于视频中语义概念检测的视网膜增强型SURF描述符

Retina-Enhanced SURF Descriptors for Semantic Concept Detection in Videos

摘要

著录项

相似文献

相关主题

期刊订阅