Spatial-Temporal Data Augmentation Based on LSTM Autoencoder Network for Skeleton-Based Human Action Recognition

机译：基于LSTM自动编码器网络的时空数据增强用于基于骨骼的人体动作识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data augmentation is known to be of crucial importance for the generalization of RNN-based methods of skeleton-based human action recognition. Traditional data augmentation methods artificially adopt various transformations merely in spatial domain, which lack effective temporal representation. This paper extends traditional Long Short-Term Memory (LSTM) and presents a novel LSTM autoencoder network (LSTM-AE) for spatial-temporal data augmentation. In the LSTM-AE, the LSTM network preserves the temporal information of skeleton sequences, and the autoencoder architecture can automatically eliminate irrelevant and redundant information. Meanwhile, a regularized cross-entropy loss is defined to guide the LSTM-AE to learn more suitable representations of skeleton data. Experimental results on the currently largest NTU RGB+D dataset and public SmartHome dataset verify that the proposed model outperforms the state-of-the-art methods, and can be integrated with most of the RNN-based action recognition models easily.

机译：众所周知，数据增强对于基于RNN的基于骨骼的人类动作识别方法的推广至关重要。传统的数据扩充方法仅在空间域中人为地采用各种变换，而这些变换缺乏有效的时间表示。本文扩展了传统的长期短期记忆（LSTM），并提出了一种用于时空数据扩充的新型LSTM自动编码器网络（LSTM-AE）。在LSTM-AE中，LSTM网络保留了骨架序列的时间信息，并且自动编码器体系结构可以自动消除无关紧要的信息。同时，定义了正则化的交叉熵损失，以指导LSTM-AE学习更合适的骨架数据表示。在当前最大的NTU RGB + D数据集和公共SmartHome数据集上的实验结果证明，所提出的模型优于最新方法，并且可以轻松地与大多数基于RNN的动作识别模型集成。

著录项

来源
《IEEE International Conference on Image Processing》|2018年|3478-3482|共5页
会议地点
作者
Juanhui Tu; Hong Liu; Fanyang Meng; Mengyuan Liu; Runwei Ding;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Skeleton; Training; Data models; Computer architecture; Decoding; Neurons; Protocols;

机译：骨架;训练;数据模型;计算机体系结构;解码;神经元;协议;
入库时间 2022-08-26 13:51:41

相似文献

外文文献
中文文献
专利

1. Sample Fusion Network: An End-to-End Data Augmentation Network for Skeleton-Based Human Action Recognition [J] . Fanyang Meng, Hong Liu, Yongsheng Liang, IEEE Transactions on Image Processing . 2019,第11期

机译：样本融合网络：用于基于骨骼的人类动作识别的端到端数据增强网络
2. Exploring a rich spatial-temporal dependent relational model for skeleton-based action recognition by bidirectional LSTM-CNN [J] . Zhu Aichun, Wu Qianyu, Cui Ran, Neurocomputing . 2020,第Nova13期

机译：通过双向LSTM-CNN探索基于骨架的动作识别的丰富的空间依赖关系模型
3. Learning representations from quadrilateral based geometric features for skeleton-based action recognition using LSTM networks [J] . Naveenkumar M., Domnic S. Intelligent decision technologies . 2020,第1期

机译：基于四边形的几何特征的学习表示，使用LSTM网络的基于骨架的动作识别
4. Spatial-Temporal Data Augmentation Based on LSTM Autoencoder Network for Skeleton-Based Human Action Recognition [C] . Juanhui Tu, Hong Liu, Fanyang Meng, IEEE International Conference on Image Processing . 2018

机译：基于LSTM AutoEncoder网络的基于LSTM的人类行动识别的空间数据增强
5. Deep Learning-Based Hosting Capacity Analysis in LV Distribution Grids with Spatial-Temporal LSTMs [D] . Wu, Jiaqi. 2021

机译：LV分布网的基于深度学习的托管能力分析，具有空间时间LSTMS
6. Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network [O] . Jiaqi Shi, Chaoran Liu, Carlos Toshinori Ishi, 2021

机译：基于骨骼的情感识别基于双流自我关注增强空间颞图卷积网络
7. Skeleton-Based Human Action Recognition with Global Context-Aware Attention LSTM Networks [O] . Liu, Jun, Wang, Gang, Duan, Ling-Yu, 2018

机译：基于骨架的人类行为识别与全局上下文意识注意LsTm Networks
8. Neural network: Multiple sensor based method for recognition of gene coding segments in human DNA sequence data. [R] . Uberbacher, E. C., Mann, R. C., Hand, R. C., 1991

机译：神经网络：基于多传感器的方法识别人类DNa序列数据中的基因编码区段。

Spatial-Temporal Data Augmentation Based on LSTM Autoencoder Network for Skeleton-Based Human Action Recognition

摘要

著录项

相似文献

相关主题

期刊订阅