首页> 外文会议>International Conference on Computer Vision >Space-Time Robust Video Representation for Action Recognition

【24h】

Space-Time Robust Video Representation for Action Recognition

机译：用于行动识别的时空强大视频表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of action recognition in unconstrained videos. We propose a novel content driven pooling that leverages space-time context while being robust toward global space-time transformations. Being robust to such transformations is of primary importance in unconstrained videos where the action localizations can drastically shift between frames. Our pooling identifies regions of interest using video structural cues estimated by different saliency functions. To combine the different structural information, we introduce an iterative structure learning algorithm, WSVM (weighted SVM), that determines the optimal saliency layout of an action model through a sparse regularizer. A new optimization method is proposed to solve the WSVM' highly non-smooth objective function. We evaluate our approach on standard action datasets (KTH, UCF50 and HMDB). Most noticeably, the accuracy of our algorithm reaches 51.8% on the challenging HMDB dataset which outperforms the state-of-the-art of 7.3% relatively.

机译：我们解决了不受约束视频中的行动认可问题。我们提出了一种新的内容驱动汇总，可以利用时空上下文，同时朝向全球时空转换的强大。对这种转换的强劲在于在不受约束的视频中具有主要重要性，其中动作本地化可以在帧之间大大移位。我们的汇集使用不同显着函数估计的视频结构提示来识别利益区域。为了结合不同的结构信息，我们介绍了一种迭代结构学习算法，WSVM（加权SVM），它通过稀疏规范器确定动作模型的最佳显着性布局。提出了一种新的优化方法来解决WSVM高度不平滑的目标函数。我们在标准动作数据集（kth，UCF50和HMDB）上评估我们的方法。最明显的是，算法的准确性在挑战的HMDB数据集中达到51.8％，以相当优于最先进的7.3％。

著录项

来源
《International Conference on Computer Vision 》|2013年||共8页
会议地点
作者
Nicolas Ballas; Betrand Delezoide; Yi Yang; Francoise Preteux; Zhen-zhong Lan; Alex Hauptmann;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. Sparse coding-based space-time video representation for action recognition [J] . Fu Yinghua, Zhang Tao, Wang Wenjin Multimedia Tools and Applications . 2017 ,第10期

机译：基于稀疏编码的时空视频表示，用于动作识别
2. A Robust and Efficient Video Representation for Action Recognition [J] . Wang Heng, Oneata Dan, Verbeek Jakob, International Journal of Computer Vision . 2016 ,第3期

机译：用于动作识别的鲁棒高效的视频表示
3. Robust video-based face recognition via M-estimator and image set collaborative representation [J] . Deng Lei, Shi Jing, Wang Yulong International Journal of Wavelets, Multiresolution and Information Processing . 2018 ,第2期

机译：基于强大的基于视频的面部识别通过M估算器和图像设置协作表示
4. Space-Time Robust Video Representation for Action Recognition [C] . Nicolas Ballas, Betrand Delezoide, Yi Yang, International Conference on Computer Vision . 2013

机译：用于行动识别的时空强大视频表示
5. Robust representation and recognition of actions in video. [D] . Natarajan, Pradeep. 2009

机译：视频中动作的可靠表示和识别。
6. Marginalised Stacked Denoising Autoencoders for Robust Representation of Real-Time Multi-View Action Recognition [O] . Feng Gu, Francisco Flórez-Revuelta, Dorothy Monekosso, 2015

机译：边缘化堆叠式降噪自动编码器用于实时多视图动作识别的鲁棒表示
7. Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition [O] . Shuyang Sun, Zhanghui Kuang, Lu Sheng, 2018

机译：光学流引导功能：用于视频动作识别的快速和强大的运动表示

Space-Time Robust Video Representation for Action Recognition

摘要

著录项

相似文献

相关主题

期刊订阅