Unsupervised learning of space-time symmetric patterns in RGB-D videos for 4D human activity detection

机译：用于4D人类活动检测的RGB-D视频中的时空对称图案的无监督学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present an approach for finding space-time activity map in a video shot using 3D moment methods. A RGB-D video involves a specific human activity is first regularly partitioned into multiple video shots in which human activities can be defined. For each video shot, we separate it into multiple video cubes which characterizes local object shape and motion. Given a local video cube, the proposed spacetime pattern detector extracts both the spatial and temporal symmetric information which are further grouped together by hashing to construct an activity map that describes the distribution of motion vectors of objects in a video shot. The intrinsic human activity in a video consisting of multiple shots is finally represented by a set of activity maps. Next, to reduce the temporal dimensionality of an activity in terms of activity maps, the kernel PCA method is applied to transform the activity representation into a set of principal activity maps. Finally, regardless of the activity types of the training videos, all the training principal activity maps are clustered into multiple clusters to generate a principal activity map dictionary. This dictionary is used to solve the initial pose problem when we use dynamic programming to align two sequences of principal activity maps for recognizing human activities in RGB-D videos. The proposed approach was tested using publicly available datasets. Experimental results demonstrate the good performance of the proposed method in terms of activity detection accuracy and execution speed.

机译：在本文中，我们提出了一种使用3D矩方法在视频镜头中查找时空活动图的方法。 RGB-D视频涉及特定的人类活动，首先将其定期分为多个可以定义人类活动的视频镜头。对于每个视频镜头，我们将其分为多个视频立方体，这些立方体描述了局部对象的形状和运动。给定一个本地视频立方体，建议的时空模式检测器提取空间和时间对称信息，通过哈希将它们进一步分组在一起，以构造一个活动图，该活动图描述视频镜头中对象运动矢量的分布。包含多个镜头的视频中固有的人类活动最终由一组活动图表示。接下来，为了根据活动图减少活动的时间维，应用内核PCA方法将活动表示转换为一组主要活动图。最后，无论训练视频的活动类型如何，所有训练主要活动图都聚类为多个聚类以生成主要活动图字典。当我们使用动态编程来对齐两个主要活动图序列以识别RGB-D视频中的人类活动时，该词典用于解决初始姿势问题。使用公开可用的数据集对提出的方法进行了测试。实验结果证明了该方法在活动检测精度和执行速度方面的良好性能。

著录项

来源
《International Symposium on Communications and Information Technologies》|2017年|1-6|共6页
会议地点
作者
Yun-Jue Chen; Shyi-Chyi Cheng; Chen-Kuai Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Videos; Three-dimensional displays; Activity recognition; Feature extraction; Dictionaries; Training; Detectors;

机译：视频;三维显示;活动识别;特征提取;词典;培训;检测器;

相似文献

外文文献
中文文献
专利

1. CoDe4D: Color-Depth Local Spatio-Temporal Features for Human Activity Recognition From RGB-D Videos [J] . Hao Zhang, Lynne E. Parker IEEE Transactions on Circuits and Systems for Video Technology . 2016,第3期

机译：CoDe4D：用于从RGB-D视频识别人类活动的色深局部时空特征
2. Learning human activities and object affordances from RGB-D videos [J] . Hema Swetha Koppula, Rudhir Gupta, Ashutosh Saxena The International journal of robotics research . 2013,第8期

机译：从RGB-D视频中学习人类活动和物体承受能力
3. Research on human activity detection from videos based on pattern recognition [J] . Zhu S. P., Liu C. Basic & clinical pharmacology & toxicology. . 2019,第S7期

机译：基于模式识别的视频人类活动检测研究
4. Unsupervised learning of space-time symmetric patterns in RGB-D videos for 4D human activity detection [C] . Yun-Jue Chen, Shyi-Chyi Cheng, Chen-Kuai Yang International Symposium on Communications and Information Technologies . 2017

机译：RGB-D视频中的时空对称模式无监督学习4D人类活动检测
5. Unsupervised Detection and Localization of Anomalous Motion Patterns in Surveillance Video [D] . Abuolaim, Abdullah A. 2017

机译：监控视频中异常运动模式的无监督检测和定位
6. An Efficient and Robust Unsupervised Anomaly Detection Method Using Ensemble Random Projection in Surveillance Videos [O] . Jingtao Hu, En Zhu, Siqi Wang, 2019

机译：监控视频中集成随机投影的高效鲁棒无监督异常检测方法
7. Learning human activities and object affordances from rgb-d videos [O] . Rudhir Gupta, Ashutosh Saxena 2016

机译：从rgb-d视频学习人类活动和对象支持

Unsupervised learning of space-time symmetric patterns in RGB-D videos for 4D human activity detection

摘要

著录项

相似文献

相关主题

期刊订阅