首页> 外文会议>IEEE International Conference on Multimedia and Expo >Spatial Mask ConvLSTM Network and Intra-Class Joint Training Method for Human Action Recognition in Video

【24h】

Spatial Mask ConvLSTM Network and Intra-Class Joint Training Method for Human Action Recognition in Video

机译：视频中人类动作识别的空间掩模ConvLSTM网络和类内联合训练方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For action recognition, attention model is widely used, but most of them lack consideration of the relationship of spatial and temporal information. We thus propose a Spatial Mask ConvLSTM Network (SM_ConvLSTM-Net) to determine the attention score of each pixel position. SM_ConvLSTM-Net is used to combine the information of space and time for getting more precise spatial mask, which has a long receptive field in time domain. Furthermore, to combine the connection of different samples from same category, a novel training method called intra-class joint training method is proposed to make network extract the common characteristics related to actions of the same class in different background. Extensive experiments illustrate the effectiveness of our method and our method significantly outperforms the baseline C3D network on UCF101 and HMDB51. Moreover, our approach achieves the best performance on UCF101 and a compared result on HMDB51 in comparison to some state-of-the-art approaches with RGB input.

机译：对于行动识别，注意模型被广泛使用，但大多数人缺乏对空间和时间信息的关系的思考。因此，我们提出了一种空间掩模GUNMLSTM网络（SM_CONVLSTM-NET），以确定每个像素位置的注意得分。 SM_CONVLSTM-NET用于将空间和时间的信息组合，以获取更精确的空间掩码，其在时域中具有长度接收字段。此外，为了组合来自相同类别的不同样本的连接，提出了一种称为类内联训练方法的新颖训练方法，以使网络提取与不同背景中同一类的动作相关的共同特征。广泛的实验说明了我们的方法的有效性，我们的方法显着优于UCF101和HMDB51上的基线C3D网络。此外，我们的方法可以实现UCF101上的最佳性能，与HMDB51的比较结果相比，与RGB输入的某些最先进的方法相比。

著录项

来源
《IEEE International Conference on Multimedia and Expo 》|2019年|1054-1059|共6页
会议地点
作者
Jingjun Chen; Yonghong Song; Yuanlin Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Three-dimensional displays; Computer architecture; Feature extraction; Standards; Optical computing; Optical imaging;

机译：培训;三维显示;计算机体系结构;特征提取;标准;光学计算;光学成像;

相似文献

外文文献
中文文献
专利

1. Attention-based spatial–temporal hierarchical ConvLSTM network for action recognition in videos [J] . Computer Vision, IET . 2019 ,第8期

机译：基于注意力的时空分层ConvLSTM网络，用于视频中的动作识别
2. Human action recognition based on quaternion spatial-temporal convolutional neural network and LSTM in RGB videos [J] . Meng Bo, Liu XueJun, Wang Xiaolin Multimedia Tools and Applications . 2018 ,第20期

机译：基于四元数时空卷积神经网络和LSTM的RGB视频人体动作识别
3. Video-Based Human Action Recognition Using Spatial Pyramid Pooling and 3D Densely Convolutional Networks [J] . Wanli Yang, Yimin Chen, Chen Huang, Future Internet . 2018 ,第12期

机译：使用空间金字塔池和3D密集卷积网络的基于视频的人类动作识别
4. SPATIAL MASK CONVLSTM NETWORK AND INTRA-CLASS JOINT TRAINING METHOD FOR HUMAN ACTION RECOGNITION IN VIDEO [C] . Jingjun Chen, Yonghong Song, Yuanlin Zhang IEEE International Conference on Multimedia and Expo . 2019

机译：空间面具Convlstm网络和课外联合培训方法在视频中的人类行动识别
5. Feature extraction method for video based human action recognitions: Extended optical flow algorithm. [D] . Ramadass, Ashok. 2010

机译：基于视频的人体动作识别的特征提取方法：扩展光流算法。
6. Gender Recognition from Human-Body Images Using Visible-Light and Thermal Camera Videos Based on a Convolutional Neural Network for Image Feature Extraction [O] . Dat Tien Nguyen, Ki Wan Kim, Hyung Gil Hong, 2017

机译：基于卷积神经网络的可见光和热成像摄像机视频对人体图像的性别识别
7. Attention‐based spatial–temporal hierarchical ConvLSTM network for action recognition in videos [O] . Fei Xue, Hongbing Ji, Wenbo Zhang, 2019

机译：视频中关注的空间时间分层Convlstm网络用于视频中的动作识别

Spatial Mask ConvLSTM Network and Intra-Class Joint Training Method for Human Action Recognition in Video

摘要

著录项

相似文献

相关主题

期刊订阅