View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics

Dhiman Chhavi; Vishwakarma Dinesh Kumar

首页> 外文期刊>IEEE Transactions on Image Processing >View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics

【24h】

View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics

机译：使用双流动作和形状时间动态进行观看 - 不变的人类行动识别的深层架构

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Human action Recognition for unknown views, is a challenging task. We propose a deep view-invariant human action recognition framework, which is a novel integration of two important action cues: motion and shape temporal dynamics (STD). The motion stream encapsulates the motion content of action as RGB Dynamic Images (RGB-DIs), which are generated by Approximate Rank Pooling (ARP) and processed by using fine-tuned InceptionV3 model. The STD stream learns long-term view-invariant shape dynamics of action using a sequence of LSTM and Bi-LSTM learning models. Human Pose Model (HPM) generates view-invariant features of structural similarity index matrix (SSIM) based key depth human pose frames. The final prediction of the action is made on the basis of three types of late fusion techniques i.e. maximum (max), average (avg) and multiply (mul), applied on individual stream scores. To validate the performance of the proposed novel framework, the experiments are performed using both cross-subject and cross-view validation schemes on three publically available benchmarks-NUCLA multi-view dataset, UWA3D-II Activity dataset and NTU RGB-D Activity dataset. Our algorithm outperforms existing state-of-the-arts significantly, which is measured in terms of recognition accuracy, receiver operating characteristic (ROC) curve and area under the curve (AUC).

机译：人类行动认可为未知观点，是一个具有挑战性的任务。我们提出了一个深刻的观点，不变的人类行动识别框架，这是两个重要行动提示的新一体化：运动和形状时间动态（STD）。运动流将动作的运动内容封装为RGB动态图像（RGB-DIS），该动态图像（RGB-DIS）由近似排名池（ARP）生成并通过使用微调IncepionV3模型进行处理。 STD流使用一系列LSTM和Bi-LSTM学习模型学习操作的长期视图 - 不变形状动态。人类姿势模型（HPM）生成基于结构相似索引矩阵（SSIM）密钥深度人姿势帧的视图 - 不变特征。基于三种类型的晚期融合技术的最终预测是在各个流分数上应用的最大（最大），平均（AVG）和乘法（MUL）。为了验证所提出的新颖框架的性能，使用三个公开可用的基准 - Nucla多视图数据集，UWA3D-II活动数据集和NTU RGB-D活动数据集来执行实验。我们的算法显着优于现有的现有最先进，这在识别精度，接收器操作特征（ROC）曲线和曲线（AUC）的区域来衡量。

著录项

来源
《IEEE Transactions on Image Processing》 |2020年第2020期|3835-3844|共10页
作者
Dhiman Chhavi; Vishwakarma Dinesh Kumar;
展开▼
作者单位

Delhi Technol Univ Dept Informat Technol Biometr Res Lab New Delhi 110042 India;

Delhi Technol Univ Dept Informat Technol Biometr Res Lab New Delhi 110042 India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Human action recognition; spatial temporal dynamics; human pose model; late fusion;

机译：人类行动识别;空间时间动态;人类姿势模型;晚融合;

相似文献

外文文献
中文文献
专利

1. Spatio–Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks ? [J] . Huy Hieu Pham, Houssam Salmane, Louahdi Khoudour, Sensors . 2019,第8期

机译：3D骨骼运动的时空图像表示，用于深度卷积神经网络的视图不变动作识别？
2. VIEW-INVARIANT HUMAN ACTIVITY RECOGNITION BASED ON SHAPE AND MOTION FEATURES [J] . F. Niu, M. Abdel-Mottaleb International Journal of Robotics & Automation . 2007,第3期

机译：基于形状和运动特征的视野不变的人类活动识别
3. Online view-invariant human action recognition using rgb-d spatio-temporal matrix [J] . Hsu Yen-Pin, Liu Chengyin, Chen Tzu-Yang, Pattern Recognition: The Journal of the Pattern Recognition Society . 2016,第Null期

机译：基于rgb-d时空矩阵的在线视野不变人类动作识别
4. Image-based shape model for view-invariant human motion recognition [C] . Ning Jin, Mokhtarian Farzin, AVSS IEEE Conference on Advanced Video and Signal Based Surveillance . 2007

机译：基于图像的形状模型，用于查看不变人类运动识别
5. Model based view-invariant human action recognition and segmentation. [D] . Lv, Fengjun. 2007

机译：基于模型的视图不变人类动作识别和细分。
6. Spatio–Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks [O] . Huy Hieu Pham, Houssam Salmane, Louahdi Khoudour, 2019

机译：深度卷积神经网络的视图不变动作识别的3D骨骼运动的时空图像表示
7. View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics [O] . Chhavi Dhiman, Dinesh Kumar Vishwakarma 2020

机译：使用双流动作和形状时间动态进行观看 - 不变的人类行动识别的深层架构

View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics

摘要

著录项

相似文献

相关主题

期刊订阅