首页> 外文OA文献 >Modeling geometric-temporal context with directional pyramid co-occurrence for action recognition

【2h】

Modeling geometric-temporal context with directional pyramid co-occurrence for action recognition

机译：使用方向金字塔共现为行为识别建模几何时间上下文

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

AI期刊论文写作 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

ududIn this paper, we present a new geometric-temporal representation for visual action recognition based on local spatio-temporal features. First, we propose a modified covariance descriptor under the log-Euclidean Riemannian metric to represent the spatio-temporal cuboids detected in the video sequences. Compared with previously proposed covariance descriptors, our descriptor can be measured and clustered in Euclidian space. Second, to capture the geometric-temporal contextual information, we construct a directional pyramid co-occurrence matrix (DPCM) to describe the spatio-temporal distribution of the vector-quantized local feature descriptors extracted from a video. DPCM characterizes the co-occurrence statistics of local features as well as the spatio-temporal positional relationships among the concurrent features. These statistics provide strong descriptive power for action recognition. To use DPCM for action recognition, we propose a directional pyramid co-occurrence matching kernel to measure the similarity of videos. The proposed method achieves the state-of-the-art performance and improves on the recognition performance of the bag-of-visual-words (BOVWs) models by a large margin on six public data sets. For example, on the KTH data set, it achieves 98.78% accuracy while the BOVW approach only achieves 88.06%. On both Weizmann and UCF CIL data sets, the highest possible accuracy of 100% is achieved.

机译：ud ud在本文中，我们提出了一种基于局部时空特征的视觉动作识别新的几何时态表示。首先，我们提出了一种改进的对数欧氏黎曼度量下的协方差描述符，以表示在视频序列中检测到的时空长方体。与先前提出的协方差描述符相比，我们的描述符可以在欧几里得空间中进行测量和聚类。其次，为了捕获几何时态上下文信息，我们构造了一个定向金字塔共现矩阵（DPCM）来描述从视频中提取的矢量量化的局部特征描述符的时空分布。 DPCM表征局部特征的共现统计以及并发特征之间的时空位置关系。这些统计数据为动作识别提供了强大的描述能力。为了使用DPCM进行动作识别，我们提出了一种定向金字塔共现匹配内核来测量视频的相似性。所提出的方法达到了最先进的性能，并且在六个公共数据集上大大提高了视觉袋（BOVW）模型的识别性能。例如，在KTH数据集上，它达到98.78％的准确性，而BOVW方法仅达到88.06％。在Weizmann和UCF CIL数据集上，都达到了100％的最高精度。

著录项

作者
Yuan C.; Li X.; Hu W.; Ling H.; Maybank Stephen J.;
展开▼
作者单位

展开▼
年度 2014
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Modeling Geometric-Temporal Context With Directional Pyramid Co-Occurrence for Action Recognition [J] . Yuan C., Li X., Hu W., IEEE Transactions on Image Processing . 2014,第2期

机译：具有方向金字塔共现的动作识别的时态上下文建模
2. Co-occurrence context of the data-driven quantized local ternary patterns for visual recognition [J] . Xian-Hua Han, Yen-Wei Chen, Gang Xu IPSJ Transactions on Computer Vision and Applications . 2017,第1期

机译：用于视觉识别的数据驱动量化局部三元模式的共现上下文
3. Gender context effects in noun recognition: grammatical cues or co-occurrence effects? [J] . Bellanger Cindy, Chevrot Jean-Pierre, Spinelli Elsa Language, cognition and neuroscience . 2017,第9期

机译：名词识别中的性别情境效应：语法线索或共同发生效果？
4. Multi-Directional Convolution Networks with Spatial-Temporal Feature Pyramid Module for Action Recognition [C] . Bohong Yang, Zijian Wang, Wu Ran, IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：具有用于动作识别的空间时间特征金字塔模块的多向卷积网络
5. Feature extraction based on spatial co-occurrence and rotation properties for image recognition [D] . 野坂龍佑 2019

机译：基于空间共现和旋转特性的特征提取用于图像识别
6. Simple biologically-constrained CA1 pyramidal cell models using an intact whole hippocampus context [O] . Katie A. Ferguson, Carey Y. L. Huh, Benedicte Amilhon, -1

机译：使用完整的完整海马背景的简单受生物限制的CA1锥体细胞模型
7. Modeling geometric-temporal context with directional pyramid co-occurrence for action recognition [O] . Yuan, C., Li, X., Hu, W., 2014

机译：利用方向金字塔共现对几何时间上下文建模以进行动作识别

Modeling geometric-temporal context with directional pyramid co-occurrence for action recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅