STRNet:Triple-stream Spatiotemporal Relation Network for Action Recognition

Zhi-Wei Xu; Xiao-Jun Wu; Josef Kittler

首页> 中文期刊> 《国际自动化与计算杂志》 >STRNet:Triple-stream Spatiotemporal Relation Network for Action Recognition

STRNet:Triple-stream Spatiotemporal Relation Network for Action Recognition

AI论文写作 >>

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Learning comprehensive spatiotemporal features is crucial for human action recognition. Existing methods tend to model the spatiotemporal feature blocks in an integrate-separate-integrate form, such as appearance-and-relation network(ARTNet) and spatiotemporal and motion network(STM). However, with blocks stacking up, the rear part of the network has poor interpretability. To avoid this problem, we propose a novel architecture called spatial temporal relation network(STRNet), which can learn explicit information of appearance, motion and especially the temporal relation information. Specifically, our STRNet is constructed by three branches,which separates the features into 1) appearance pathway, to obtain spatial semantics, 2) motion pathway, to reinforce the spatiotemporal feature representation, and 3) relation pathway, to focus on capturing temporal relation details of successive frames and to explore long-term representation dependency. In addition, our STRNet does not just simply merge the multi-branch information, but we apply a flexible and effective strategy to fuse the complementary information from multiple pathways. We evaluate our network on four major action recognition benchmarks: Kinetics-400, UCF-101, HMDB-51, and Something-Something v1, demonstrating that the performance of our STRNet achieves the state-of-the-art result on the UCF-101 and HMDB-51 datasets, as well as a comparable accuracy with the state-of-the-art method on Something-Something v1 and Kinetics-400.

著录项

来源
《国际自动化与计算杂志》 |2021年第5期|718-730|共13页
作者
Zhi-Wei Xu; Xiao-Jun Wu; Josef Kittler;
展开▼
作者单位

School of Artificial Intelligence and Computer Science;

Jiangnan University;

Wuxi 214122;

China;

Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence;

Wuxi 214122;

China;

Centre for Vision;

Speech and Signal Processing;

University of Surrey;

Guildford;

GU27XH;

UK;

展开▼
原文格式 PDF
正文语种 chi
中图分类人工神经网络与计算;
关键词
Action recognition; spatiotemporal relation; multi-branch fusion; long-term representation; video classification;

相似文献

中文文献
外文文献
专利

1. MSF-Net: A Multilevel Spatiotemporal Feature Fusion Network Combines Attention for Action Recognition [J] . Mengmeng Yan ,Chuang Zhang ,Jinqi Chu . 计算机系统科学与工程(英文) . 2023,第11期
2. Fuzzy Empowered Cognitive Spatial Relation Identification and Semantic Action Recognition [J] . R. I. Minu ,G. Nagarajan . 电路与系统(英文) . 2016,第8期
3. Using BlazePose on Spatial Temporal Graph Convolutional Networks for Action Recognition [J] . Motasem S.Alsawadi ,El-Sayed M.El-kenawy ,Miguel Rio . 计算机、材料和连续体(英文) . 2023,第1期
4. A Novel Action Transformer Network for Hybrid Multimodal Sign Language Recognition [J] . Sameena Javaid ,Safdar Rizvi . 计算机、材料和连续体(英文) . 2023,第1期
5. Fine-Grained Action Recognition Based on Temporal Pyramid Excitation Network [J] . Xuan Zhou ,Jianping Yi . 智能自动化与软计算(英文) . 2023,第8期
6. Diamond Body Defect Recognition System Based on Fuzzy Neural Network [C] . Hongyan HUA . 第三届国际信息技术与管理科学学术研讨会 . 2011
7. Human Action Recognition Using 3D--Convolution Neural Networks [A] . Abdul Majid . 2019

STRNet:Triple-stream Spatiotemporal Relation Network for Action Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅