Spatio-Temporal Fusion Networks for Action Recognition

机译：时空融合网络的动作识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The video based CNN works have focused on effective ways to fuse appearance and motion networks, but they typically lack utilizing temporal information over video frames. In this work, we present a novel spatio-temporal fusion network (STFN) that integrates temporal dynamics of appearance and motion information from entire videos. The captured temporal dynamic information is then aggregated for a better video level representation and learned via end-to-end training. The spatio-temporal fusion network consists of two set of Residual Inception blocks that extract temporal dynamics and a fusion connection for appearance and motion features. The benefits of STFN are: (a) it captures local and global temporal dynamics of complementary data to learn video-wide information; and (b) it is applicable to any network for video classification to boost performance. We explore a variety of design choices for STFN and verify how the network performance is varied with the ablation studies. We perform experiments on two challenging human activity datasets, UCF101 and HMDB51, and achieve the state-of-the-art results with the best network.

机译：基于视频的CNN工作集中在融合外观和运动网络的有效方法上，但是它们通常缺乏在视频帧上利用时间信息的能力。在这项工作中，我们提出了一种新颖的时空融合网络（STFN），该网络整合了来自整个视频的外观和运动信息的时间动态。然后，将捕获的时间动态信息进行汇总，以获得更好的视频级别表示，并通过端到端训练进行学习。时空融合网络由两套残余初始块组成，它们提取了时间动态特性以及用于外观和运动特征的融合连接。 STFN的好处是：（a）它捕获补充数据的局部和全局时间动态，以学习视频范围的信息; （b）适用于任何视频分类网络以提高性能。我们探索STFN的多种设计选择，并通过消融研究验证网络性能如何变化。我们在两个具有挑战性的人类活动数据集UCF101和HMDB51上进行了实验，并以最佳的网络获得了最新的结果。

著录项

来源
《Asian Conference on Computer Vision》|2018年|347-364|共18页
会议地点
作者
Sangwoo Cho; Hassan Foroosh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Action recognition; Spatio-temporal fusion; Temporal dynamics;

机译：动作识别;时空融合;时间动态;

相似文献

外文文献
中文文献
专利

1. Probabilistic Reasoning for Unique Role Recognition Based on the Fusion of Semantic-Interaction and Spatio-Temporal Features [J] . Yang Chule, Yue Yufeng, Zhang Jun, IEEE transactions on multimedia . 2019,第5期

机译：基于语义交互和时空特征融合的唯一角色识别的概率推理
2. Probabilistic Reasoning for Unique Role Recognition Based on the Fusion of Semantic-Interaction and Spatio-Temporal Features [J] . Yang Chule, Yue Yufeng, Zhang Jun, IEEE transactions on multimedia . 2019,第5期

机译：基于语义交互和时空特征融合的独特作用识别的概率推理
3. Action recognition with spatio-temporal augmented descriptor and fusion method (vol 76, pg 13953, 2017) [J] . Li Lijun, Dai Shuling Multimedia Tools and Applications . 2017,第12期

机译：时空增强描述符和融合方法的动作识别（vol 76，pg 13953，2017）
4. Spatio-Temporal Fusion Networks for Action Recognition [C] . Sangwoo Cho, Hassan Foroosh Asian Conference on Computer Vision . 2019

机译：用于行动识别的时空融合网络
5. Neural networks for vision and pattern recognition: Boundary completion, spatial mapping, and multidimensional data fusion. [D] . Lesher, Gregory W. 1994

机译：用于视觉和模式识别的神经网络：边界完成，空间映射和多维数据融合。
6. Whole and Part Adaptive Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition [O] . Qi Zuo, Lian Zou, Cien Fan, 2020

机译：基于骨架的动作识别的整个和部分自适应融合图卷积网络
7. Human Action Recognition in Videos using Convolution Long Short-Term Memory Network with Spatio-Temporal Networks [O] . Ashok Sarabu, Ajit Kumar Santra 2021

机译：使用带有时空网络的卷积长短短期内存网络的视频中的人为行动认可
8. Garbled Text String Recognition with a Spatio-Temporal Pattern Recognition NeuralNetwork (Verminkte Tekst String Herkenning met een Spatio-Temporeel Patroon Herkennings Neuraal Netwerk) [R] . Meiler, P. P. 1990

机译：带有时空模式识别的乱码文本字符串识别NeuralNetwork（Verminkte Tekst字符串Herkenning遇见了enen spatio-Temporeel patroon Herkennings Neuraal Netwerk）

Spatio-Temporal Fusion Networks for Action Recognition

摘要

著录项

相似文献

相关主题

期刊订阅