Sparse coding-based space-time video representation for action recognition

Fu Yinghua; Zhang Tao; Wang Wenjin

首页> 外文期刊>Multimedia Tools and Applications >Sparse coding-based space-time video representation for action recognition

【24h】

Sparse coding-based space-time video representation for action recognition

机译：基于稀疏编码的时空视频表示，用于动作识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Methods based on feature descriptors around local interest points are now widely used in action recognition. Feature points are detected using a number of measures, namely saliency, periodicity, motion activity etc. Each of these measures is usually intensity-based and provides a trade-off between density and informativeness. In this paper, we address the problem of action recognition by representing image sequences as a sparse collection of patch-level space-time events that are salient in both space and time domain. Our method uses a multi-scale volumetric representation of video and adaptively selects an optimal space-time scale under which the saliency of a patch is most significant. The input image sequences are first partitioned into non-overlapping patches. Then, each patch is represented by a vector of coefficients that can linearly reconstruct the patch from a learned dictionary of basis patches. The space-time saliency of patches is measured by Shannon's self-information entropy, where a patch's saliency is determined by information variation in the contents of the patch's spatiotemporal neighborhood. Experimental results on three benchmark datasets demonstrate the effectiveness of the proposed method.

机译：基于局部兴趣点周围特征描述符的方法现已广泛用于动作识别中。特征点是使用多种度量来检测的，即显着性，周期性，运动活动等。这些度量中的每一种通常都是基于强度的，并在密度和信息量之间进行权衡。在本文中，我们通过将图像序列表示为稀疏的补丁程序级时空事件集合来解决动作识别问题，这些事件在时域和时域上都是显着的。我们的方法使用视频的多尺度体积表示，并自适应地选择最佳时空尺度，在该尺度下补丁的显着性最为显着。首先将输入图像序列划分为不重叠的块。然后，每个补丁都由一个系数向量表示，该系数向量可以从学习的基础补丁字典中线性地重建补丁。补丁的时空显着性是通过Shannon的自我信息熵来衡量的，其中补丁的显着性是由补丁时空邻域的内容中的信息变化来确定的。在三个基准数据集上的实验结果证明了该方法的有效性。

著录项

来源
《Multimedia Tools and Applications》 |2017年第10期|12645-12658|共14页
作者
Fu Yinghua; Zhang Tao; Wang Wenjin;
展开▼
作者单位

Shanghai Jiao Tong Univ, Dept Automat, Shanghai, Peoples R China|Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai, Peoples R China;

Shanghai Jiao Tong Univ, Dept Automat, Shanghai, Peoples R China;

Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Sparse coding; Space-time saliency; Action recognition; Self-information; Shannon entropy;

机译：稀疏编码时空显着性动作识别自我信息香农熵;

相似文献

外文文献
中文文献
专利

1. Sparse coding-based representation of LBP difference for 30/4D facial expression recognition [J] . Bejaoui Hela, Ghazouani Haythem, Barhoumi Walid Multimedia Tools and Applications . 2019,第16期

机译：30 / 4D面部表情识别的基于稀疏编码的LBP差异表示
2. Pattern Recognition special issue: Sparse representation for event recognition in videosurveillance (Editorial) [J] . Zhou H., Zhang J., Wang L., Pattern Recognition: The Journal of the Pattern Recognition Society . 2013,第7期

机译：模式识别特刊：视频监视中事件识别的稀疏表示（编辑）
3. Sparse composition of body poses and atomic actions for human activity recognition in RGB-D videos [J] . Lillo Ivan, Niebles Juan Carlos, Soto Alvaro Image and Vision Computing . 2017,第MARa期

机译：RGB-D视频中人体姿势识别和原子动作的稀疏构成
4. Space-Time Robust Video Representation for Action Recognition [C] . Nicolas Ballas, Betrand Delezoide, Yi Yang, International Conference on Computer Vision . 2013

机译：用于行动识别的时空强大视频表示
5. Robust representation and recognition of actions in video. [D] . Natarajan, Pradeep. 2009

机译：视频中动作的可靠表示和识别。
6. Sparse Representation for Tumor Classification Based on Feature Extraction Using Latent Low-Rank Representation [O] . Bin Gan, Chun-Hou Zheng, Jun Zhang, -1

机译：基于潜在低秩表示的特征提取的肿瘤分类的稀疏表示
7. Action recognition in video by sparse representation on covariance manifolds of silhouette tunnels [O] . Kai Guo, Prakash Ishwar, Janusz Konrad 2010

机译：稀疏表示在轮廓隧道协方差流形上的视频动作识别

Sparse coding-based space-time video representation for action recognition

摘要

著录项

相似文献

相关主题

期刊订阅