Spatio-Temporal Fusion Networks for Action Recognition

机译：用于行动识别的时空融合网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The video based CNN works have focused on effective ways to fuse appearance and motion networks, but they typically lack utilizing temporal information over video frames. In this work, we present a novel spatio-temporal fusion network (STFN) that integrates temporal dynamics of appearance and motion information from entire videos. The captured temporal dynamic information is then aggregated for a better video level representation and learned via end-to-end training. The spatio-temporal fusion network consists of two set of Residual Inception blocks that extract temporal dynamics and a fusion connection for appearance and motion features. The benefits of STFN are: (a) it captures local and global temporal dynamics of complementary data to learn video-wide information; and (b) it is applicable to any network for video classification to boost performance. We explore a variety of design choices for STFN and verify how the network performance is varied with the ablation studies. We perform experiments on two challenging human activity datasets, UCF101 and HMDB51, and achieve the state-of-the-art results with the best network.

机译：基于视频的CNN工程专注于熔断器外观和运动网络的有效方法，而是通常缺少在视频帧上使用时间信息。在这项工作中，我们提出了一种新颖的时空融合网络（STFN），它集成了整个视频的外观和运动信息的时间动态。然后聚合捕获的时间动态信息以获得更好的视频级表示，并通过端到端培训学习。时空融合网络由两组残差块组成，提取时间动态和用于外观和运动功能的融合连接。 STFN的好处是：（a）它捕获了互补数据的本地和全局时间动态，以学习视频范围信息; （b）适用于任何用于促进性能的视频分类网络。我们探索STFN的各种设计选择，并验证网络性能如何随着消融研究而变化。我们对两个具有挑战性的人类活动数据集，UCF101和HMDB51进行实验，并通过最佳网络实现最先进的结果。

著录项

来源
《Asian Conference on Computer Vision》|2019年|xx 722 p.|共18页
会议地点
作者
Sangwoo Cho; Hassan Foroosh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词
Action recognition; Spatio-temporal fusion; Temporal dynamics;

机译：行动识别;时空融合;时间动态;

相似文献

外文文献
中文文献
专利

1. Probabilistic Reasoning for Unique Role Recognition Based on the Fusion of Semantic-Interaction and Spatio-Temporal Features [J] . Yang Chule, Yue Yufeng, Zhang Jun, IEEE transactions on multimedia . 2019,第5期

机译：基于语义交互和时空特征融合的唯一角色识别的概率推理
2. Probabilistic Reasoning for Unique Role Recognition Based on the Fusion of Semantic-Interaction and Spatio-Temporal Features [J] . Yang Chule, Yue Yufeng, Zhang Jun, IEEE transactions on multimedia . 2019,第5期

机译：基于语义交互和时空特征融合的独特作用识别的概率推理
3. Action recognition with spatio-temporal augmented descriptor and fusion method (vol 76, pg 13953, 2017) [J] . Li Lijun, Dai Shuling Multimedia Tools and Applications . 2017,第12期

机译：时空增强描述符和融合方法的动作识别（vol 76，pg 13953，2017）
4. Spatio-Temporal Fusion Networks for Action Recognition [C] . Sangwoo Cho, Hassan Foroosh Asian Conference on Computer Vision . 2019

机译：用于行动识别的时空融合网络
5. Neural networks for vision and pattern recognition: Boundary completion, spatial mapping, and multidimensional data fusion. [D] . Lesher, Gregory W. 1994

机译：用于视觉和模式识别的神经网络：边界完成，空间映射和多维数据融合。
6. Whole and Part Adaptive Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition [O] . Qi Zuo, Lian Zou, Cien Fan, 2020

机译：基于骨架的动作识别的整个和部分自适应融合图卷积网络
7. Human Action Recognition in Videos using Convolution Long Short-Term Memory Network with Spatio-Temporal Networks [O] . Ashok Sarabu, Ajit Kumar Santra 2021

机译：使用带有时空网络的卷积长短短期内存网络的视频中的人为行动认可
8. Garbled Text String Recognition with a Spatio-Temporal Pattern Recognition NeuralNetwork (Verminkte Tekst String Herkenning met een Spatio-Temporeel Patroon Herkennings Neuraal Netwerk) [R] . Meiler, P. P. 1990

机译：带有时空模式识别的乱码文本字符串识别NeuralNetwork（Verminkte Tekst字符串Herkenning遇见了enen spatio-Temporeel patroon Herkennings Neuraal Netwerk）

Spatio-Temporal Fusion Networks for Action Recognition

摘要

著录项

相似文献

相关主题

期刊订阅