Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection

Sidiropoulos P.; Mezaris V.; Kompatsiaris I.

首页> 外文期刊>Circuits and Systems for Video Technology, IEEE Transactions on >Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection

【24h】

Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection

机译：视频断层扫描仪和基础探测器选择策略，用于改善大规模视频概念检测

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we deal with the problem of video concept detection to use the concept detection results toward a more effective concept-based video retrieval. The key novelties of this paper are as follows: 1) the use of spatio-temporal video slices (tomographs) in the same way that visual keyframes are typically used in video concept detection schemes. These spatio-temporal slices capture in a compact way motion patterns that are useful for detecting semantic concepts and are used for training a number of base detectors. The latter augment the set of keyframe-based base detectors that can be trained using different frame representations. 2) The introduction of a generic methodology, built upon a genetic algorithm, for controlling which subset of the available base detectors (consequently, which subset of the possible shot representations) should be combined for developing an optimal detector for each specific concept. This methodology is directly applicable to the learning of hundreds of diverse concepts, while diverging from the one-size-fits-all approach that is typically used in problems of this size. The proposed techniques are evaluated on the datasets of the 2011 and 2012 Semantic Indexing Task of TRECVID, each comprising several hundred hours of heterogeneous video clips and ground-truth annotations for tens of concepts that exhibit significant variation in terms of generality, complexity, and human participation. The experimental results manifest the merit of the proposed techniques.

机译：在本文中，我们处理视频概念检测的问题，以将概念检测结果用于更有效的基于概念的视频检索。本文的主要新颖之处如下：1）时空视频切片（断层图）的使用与视频概念检测方案中通常使用视觉关键帧的方式相同。这些时空切片以紧凑的方式捕获了运动模式，这些运动模式对于检测语义概念很有用，并用于训练许多基本检测器。后者增强了基于关键帧的基础检测器的集合，可以使用不同的帧表示对其进行训练。 2）引入了一种基于遗传算法的通用方法，用于控制可用基本检测器的哪个子集（因此，可能的镜头表示的哪个子集）应结合起来，以便为每个特定概念开发最佳检测器。这种方法直接适用于数百种不同概念的学习，而不同于通常用于这种规模问题的“一刀切”的方法。在2011年和2012年TRECVID语义索引任务的数据集上对提出的技术进行了评估，每个数据集包含数百小时的异类视频剪辑和真实的注释，这些概念的数十个概念在通用性，复杂性和人性方面均表现出很大差异参与。实验结果证明了所提出技术的优点。

著录项

来源
《Circuits and Systems for Video Technology, IEEE Transactions on》 |2014年第7期|1251-1264|共14页
作者
Sidiropoulos P.; Mezaris V.; Kompatsiaris I.;
展开▼
作者单位

Mullard Space Science Laboratory, University College London, London, U.K.|c|;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Feature extraction; genetic algorithms; image sequence analysis; machine learning algorithms; video concept detection; video signal processing;

机译：特征提取;遗传算法;图像序列分析;机器学习算法;视频概念检测;视频信号处理;

相似文献

外文文献
中文文献
专利

1. Semantic Video Concept Detection using Subspace-partition based Scheme - TRECVid 2012 Semantic Video Concept Detection by NTT [J] . Yongqing SUN, Kyoko SUDO, Yukinobu TANIGUCHI, 電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding . 2012,第441期

机译：使用基于子空间分区的方案进行语义视频概念检测-TRECVid 2012 NTT进行语义视频概念检测
2. The equipment detection and localization of large-scale construction jobsite by far-field construction surveillance video based on improving YOLOv3 and grey wolf optimizer improving extreme learning machine [J] . Zeng Tuocheng, Wang Jiajun, Cui Bo, Construction and Building Materials . 2021,第Jula12期

机译：基于改进Yolov3和灰狼优化改善极限学习机的大型施工监测视频的大型建筑职位的设备检测与定位
3. Simulating the future of concept-based video retrieval under improved detector performance [J] . Robin Aly, Djoerd Hiemstra, Franciska de Jong, Multimedia Tools and Applications . 2012,第1期

机译：在改进的检测器性能下模拟基于概念的视频检索的未来
4. Tomograph based concept detection of video data using PSO [C] . Priyanka Lall, Swati Sorte . 2017

机译：使用PSO的基于断层扫描技术的视频数据概念检测
5. Concept-based large-scale video database browsing and retrieval via visualization. [D] . Luo, Hangzai. 2007

机译：基于概念的大型视频数据库浏览和可视化检索。
6. Efficient Video Panoramic Image Stitching Based on an Improved Selection of Harris Corners and a Multiple-Constraint Corner Matching [O] . Minchen Zhu, Weizhi Wang, Binghan Liu, -1

机译：基于改进的Harris角选择和多约束角匹配的高效视频全景图像拼接
7. Two-Layers re-Ranking Approach based on Contextual Information for Visual Concepts Detection in Videos [O] . Hamadi, Abdelkader, Quénot, Georges, Mulhem, Philippe 2012

机译：基于上下文信息的两层重排方法在视频视觉概念检测中的应用
8. Improving Stop Line Detection Using Video Imaging Detectors [R] . 2010

机译：使用视频成像探测器改善停止线检测

Video Tomographs and a Base Detector Selection Strategy for Improving Large-Scale Video Concept Detection

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅