Video retrieval of near-duplicates using k-nearest neighbor retrieval of spatio-temporal descriptors

Daniel DeMenthon; David Doermann

首页> 外文期刊>Multimedia Tools and Applications >Video retrieval of near-duplicates using k-nearest neighbor retrieval of spatio-temporal descriptors

【24h】

Video retrieval of near-duplicates using k-nearest neighbor retrieval of spatio-temporal descriptors

机译：使用时空描述符的k最近邻检索进行近乎重复的视频检索

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a novel methodology for implementing video search functions such as retrieval of near-duplicate videos and recognition of actions in surveillance video. Videos are divided into half-second clips whose stacked frames produce 3D space-time volumes of pixels. Pixel regions with consistent color and motion properties are extracted from these 3D volumes by a threshold-free hierarchical space-time segmentation technique. Each region is then described by a high-dimensional point whose components represent the position, orientation and, when possible, color of the region. In the indexing phase for a video database, these points are assigned labels that specify their video clip of origin. All the labeled points for all the clips are stored into a single binary tree for efficient k-nearest neighbor retrieval. The retrieval phase uses video segments as queries. Half-second clips of these queries are again segmented by space-time segmentation to produce sets of points, and for each point the labels of its nearest neighbors are retrieved. The labels that receive the largest numbers of votes correspond to the database clips that are the most similar to the query video segment. We illustrate this approach for video indexing and retrieval and for action recognition. First, we describe retrieval experiments for dynamic logos, and for video queries that differ from the indexed broadcasts by the addition of large overlays. Then we describe experiments in which office actions (such as pulling and closing drawers, taking and storing items, picking up and putting down a phone) are recognized. Color information is ignored to insure independence of action recognition to people's appearance. One of the distinct advantages of using this approach for action recognition is that there is no need for detection or recognition of body parts.

机译：本文介绍了一种用于实现视频搜索功能的新颖方法，例如检索近重复的视频以及识别监视视频中的动作。视频被分为半秒的剪辑，其堆叠的帧产生3D时空像素。通过无阈值分层时空分割技术从这些3D体积中提取具有一致颜色和运动属性的像素区域。然后，每个区域都由一个高维点来描述，该高维点的成分代表该区域的位置，方向以及颜色（如果可能）。在视频数据库的索引阶段，为这些点分配了标签，这些标签指定了它们的视频原始片段。所有剪辑的所有标记点都存储在单个二叉树中，以进行有效的k最近邻检索。检索阶段将视频片段用作查询。这些查询的半秒片段再次通过时空分割进行分割，以生成点集，并为每个点检索其最近邻居的标签。获得最多票数的标签对应于与查询视频片段最相似的数据库剪辑。我们说明了这种用于视频索引和检索以及动作识别的方法。首先，我们描述了动态徽标的检索实验，以及通过添加大的覆盖层而不同于索引广播的视频查询的检索实验。然后，我们描述了可以识别办公动作（例如拉动和合上抽屉，拿起和存放物品，拿起和放下电话）的实验。颜色信息被忽略以确保动作识别与人的外观无关。使用这种方法进行动作识别的独特优势之一是不需要检测或识别身体部位。

著录项

来源
《Multimedia Tools and Applications》 |2006年第3期|p.229-253|共25页
作者
Daniel DeMenthon; David Doermann;
展开▼
作者单位

Language and Media Processing (LAMP), University of Maryland Institute for Advanced Computer Studies (UMIACS), College Park, MD 20742, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
content-based indexing and retrieval; video retrieval of near-duplicates; action recognition; space-time segmentation; spatio-temporal descriptors; object motion;

机译：基于内容的索引和检索;近重复视频检索;动作识别;时空分割;时空描述符;物体运动;

相似文献

外文文献
中文文献
专利

1. Content-Based Image Retrieval Using Color Layout Descriptor, Gray-Level Co-Occurrence Matrix and K-Nearest Neighbors [J] . Farhan Sadique, S M Rafizul Haque International Journal of Information Technology and Computer Science . 2020,第3期

机译：基于内容的图像检索使用颜色布局描述符，灰度级共生矩阵和k离邻居
2. Pattern-Based Near-Duplicate Video Retrieval and Localization on Web-Scale Videos [J] . Chien-Li Chou, Hua-Tsung Chen, Suh-Yin Lee Multimedia, IEEE Transactions on . 2015,第3期

机译：Web模式视频上基于模式的近重复视频检索和本地化
3. Correlation-Based Retrieval for Heavily Changed Near-Duplicate Videos [J] . JIAJUN LIU, Zl HUANG, HENG TAO SHEN, ACM Transactions on Information Systems . 2011,第4期

机译：大量更改的近重复视频的基于相关性的检索
4. PROBABILISTIC APPROACH TO K-NEAREST NEIGHBOR VIDEO RETRIEVAL [C] . Nai-xiang Lion, Yap-Peng Tan IEEE International Symposium on Circuits and Systems . 2004

机译：k离邻邻视频检索的概率方法
5. Spatio-temporal visual information analysis for moving object detection and retrieval in video sequences. [D] . Liu, Dianting. 2013

机译：时空视觉信息分析，用于视频序列中的运动对象检测和检索。
6. Large Scale Near-Duplicate Celebrity Web Images Retrieval Using Visual and Textual Features [O] . Fengcai Qiao, Cheng Wang, Xin Zhang, 2013

机译：使用视觉和文字功能进行大规模近乎重复的名人Web图像检索
7. Video Retrieval Of Near-Duplicates Using k-Nearest Neighbor Retrieval Of Spatio-Temporal Descriptors [O] . Daniel Dementhon 2005

机译：利用时空描述符的k-最近邻检索进行近似重复的视频检索

Video retrieval of near-duplicates using k-nearest neighbor retrieval of spatio-temporal descriptors

摘要

著录项

相似文献

相关主题

期刊订阅