...
首页> 外文期刊>Computer vision and image understanding >Action recognition using global spatio-temporal features derived from sparse representations
【24h】

Action recognition using global spatio-temporal features derived from sparse representations

机译:使用基于稀疏表示的全局时空特征进行动作识别

获取原文
获取原文并翻译 | 示例
           

摘要

Recognizing actions is one of the important challenges in computer vision with respect to video data, with applications to surveillance, diagnostics of mental disorders, and video retrieval. Compared to other data modalities such as documents and images, processing video data demands orders of magnitude higher computational and storage resources. One way to alleviate this difficulty is to focus the computations to informative (salient) regions of the video. In this paper, we propose a novel global spatio-temporal self-similarity measure to score saliency using the ideas of dictionary learning and sparse coding. In contrast to existing methods that use local spatio-temporal feature detectors along with descriptors (such as HOG, HOG3D, and HOF), dictionary learning helps consider the saliency in a global setting (on the entire video) in a computationally efficient way. We consider only a small percentage of the most salient (least self-similar) regions found using our algorithm, over which spatio-temporal descriptors such as HOG and region covariance descriptors are computed. The ensemble of such block descriptors in a bag-of-features framework provides a holistic description of the motion sequence which can be used in a classification setting. Experiments on several benchmark datasets in video based action classification demonstrate that our approach performs competitively to the state of the art.
机译:识别动作是计算机视觉中有关视频数据的重要挑战之一,并将其​​应用于监视,精神障碍诊断和视频检索。与其他数据形式(例如文档和图像)相比,处理视频数据需要更高数量级的计算和存储资源。减轻此困难的一种方法是将计算重点放在视频的信息(显着)区域。在本文中,我们提出了一种新颖的全局时空自相似性度量,以利用字典学习和稀疏编码的思想对显着性评分。与使用局部时空特征检测器以及描述符(例如HOG,HOG3D和HOF)的现有方法相比,字典学习有助于以计算有效的方式考虑全局设置(在整个视频中)的显着性。我们仅考虑使用我们的算法发现的最显着(最小自相似)区域的一小部分,在这些区域上可以计算时空描述符(例如HOG和区域协方差描述符)。功能包框架中此类块描述符的集合提供了可在分类设置中使用的运动序列的整体描述。在基于视频的动作分类中对几个基准数据集进行的实验表明,我们的方法在技术水平上具有竞争力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号