An evaluation of bags-of-words and spatio-temporal shapes for action recognition

机译：评估词袋和时空形状以进行动作识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bags-of-visual-Words (BoW) and Spatio-Temporal Shapes (STS) are two very popular approaches for action recognition from video. The former (BoW) is an un-structured global representation of videos which is built using a large set of local features. The latter (STS) uses a single feature located on a region of interest (where the actor is) in the video. Despite the popularity of these methods, no comparison between them has been done. Also, given that BoW and STS differ intrinsically in terms of context inclusion and globality/locality of operation, an appropriate evaluation framework has to be designed carefully. This paper compares these two approaches using four different datasets with varied degree of space-time specificity of the actions and varied relevance of the contextual background. We use the same local feature extraction method and the same classifier for both approaches. Further to BoW and STS, we also evaluated novel variations of BoW constrained in time or space. We observe that the STS approach leads to better results in all datasets whose background is of little relevance to action classification.

机译：视觉文字袋（BoW）和时空形状（STS）是从视频中识别动作的两种非常流行的方法。前者（BoW）是使用大量本地功能构建的视频的非结构化全局表示。后者（STS）使用位于视频中感兴趣区域（演员所在的区域）的单个功能。尽管这些方法很流行，但它们之间没有进行比较。同样，鉴于BoW和STS在上下文包容性和运营的全局性/本地性方面存在本质上的差异，因此必须仔细设计合适的评估框架。本文使用四种不同的数据集对这两种方法进行了比较，这些数据集具有不同程度的动作时空特异性和背景相关性。对于这两种方法，我们使用相同的局部特征提取方法和相同的分类器。除了BoW和STS，我们还评估了受时间或空间限制的BoW的新颖变化。我们观察到，STS方法在所有背景与动作分类无关的数据集中产生了更好的结果。

著录项

来源
《2011 IEEE Workshop on Applications of Computer Vision》|2011年|p.344-351|共8页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类模式识别与装置;
关键词

相似文献

外文文献
中文文献
专利

1. Evaluating a bag-of-visual features approach using spatio-temporal features for action recognition [J] . Nazir Saima, Yousaf Muhammad Haroon, Velastin Sergio A. Computers and Electrical Engineering . 2018,第期

机译：使用用于动作识别的时空特征来评估一种可视化特征方法
2. Fusing shape and spatio-temporal features for depth-based dynamic hand gesture recognition [J] . Zheng Jinqing, Feng Zhiyong, Xu Chao, Multimedia Tools and Applications . 2017,第20期

机译：融合形状和时空特征以实现基于深度的动态手势识别
3. Extended Evaluation of XZ-Shape Histogram for Human-Object Interaction Activity Recognition based on Kinect-like Depth Image [J] . M. A. ASARI, U. U. SHEIKH, A. H. OMAR, WSEAS Transactions on Signal Processing . 2016,第Null期

机译：基于Kinect深度图像的XZ形直方图对人-物体交互活动识别的扩展评估
4. An evaluation of bags-of-words and spatio-temporal shapes for action recognition [C] . {missing} IEEE Workshop on Applications of Computer Vision . 2011

机译：对动作识别的袋式和时空形状的评估
5. Probabilistic Shape Parsing and Action Recognition Through Binary Spatio-Temporal Feature Description. [D] . Whiten, Christopher James. 2013

机译：通过二进制时空特征描述的概率形状解析和动作识别。
6. Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing [O] . Pau Climent-Pérez, Francisco Florez-Revuelta 2021

机译：使用替代骨骼和视频预处理改进了可分离的时空关注的动作识别
7. An evaluation of bags-of-words and spatio-temporal shapes for action recognition [O] . De Campos T, Barnard M, Mikolajczyk K, 2011

机译：用于动作识别的词袋和时空形状的评估

An evaluation of bags-of-words and spatio-temporal shapes for action recognition

摘要

著录项

相似文献

相关主题

期刊订阅