首页> 外文期刊>Journal of vision >Action Encoding and Recognition based on Multi-Scale Spatial-Temporal Natural Action Structures
【24h】

Action Encoding and Recognition based on Multi-Scale Spatial-Temporal Natural Action Structures

机译:基于多尺度时空自然动作结构的动作编码与识别

获取原文
       

摘要

Abstract The visual systems of human and animals respond to a range of actions very quickly. This fast and efficient process includes action detection, recognition, and classification. Extensive studies on action recognition have been performed in the areas of machine learning and computer vision. A key issue is to seek efficient encoding units of natural actions. Current global encoding schemes depend heavily on video segmentation while local encoding schemes lack descriptive power. In this work, natural action structures (NAS) were proposed. NAS are multi-size, multi-scale, spatial-temporal concatenations of local features and function as the basic encoding units of actions. Our approach included patch sampling, independent component analysis, Gabor fitting and clustering, feature space mapping, and NAS constructing. Two improvements over an earlier model were made in the approach. First, in the process of sampling a large number of sequences of circular patches at multiple spatial-temporal scales, a machine learning approach was developed to select interest points based on spatial-temporal features. Second, another machine learning approach with cross-validation was developed to select informative NAS for each action. The performance this NAS-based model of action recognition on several widely used datasets was better than that of the start-of-the-art models, including a biologically motivated system. In conclusion, the proposed NAS are a set of good encoding units of natural actions and the NAS-based action recognition scheme provides important insights to natural action understanding. Key Words: Action recognition, Natural Action Structures, Action encoding.
机译:摘要人和动物的视觉系统对一系列动作的反应非常快。这个快速而有效的过程包括动作检测,识别和分类。在机器学习和计算机视觉领域已经对动作识别进行了广泛的研究。一个关键问题是寻求自然动作的有效编码单位。当前的全局编码方案严重依赖于视频分割,而本地编码方案缺乏描述能力。在这项工作中,提出了自然行动结构(NAS)。 NAS是局部特征的多尺度,多尺度,时空串联,并作为动作的基本编码单位。我们的方法包括补丁采样,独立组件分析,Gabor拟合和聚类,特征空间映射以及NAS构造。该方法对早期模型进行了两项改进。首先,在以多个时空尺度对大量圆形补丁序列进行采样的过程中,人们开发了一种机器学习方法来基于时空特征选择兴趣点。其次,开发了另一种具有交叉验证的机器学习方法,以便为每个操作选择信息丰富的NAS。这种基于NAS的动作识别模型在多个广泛使用的数据集上的性能要优于包括生物动力系统在内的最新模型。总之,提出的NAS是自然动作的良好编码单元集,基于NAS的动作识别方案为自然动作理解提供了重要的见识。关键词:动作识别,自然动作结构,动作编码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号