First Person Action Recognition via Two-stream ConvNet with Long-term Fusion Pooling

Kwon Heeseung; Kim Yeonho; Lee Jin S.; Cho Minsu

首页> 外文期刊>Pattern recognition letters >First Person Action Recognition via Two-stream ConvNet with Long-term Fusion Pooling

【24h】

First Person Action Recognition via Two-stream ConvNet with Long-term Fusion Pooling

机译：通过具有长期融合池的两流ConvNet进行第一人称动作识别

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

First person action recognition is an active research area with increasingly popular wearable devices. Action classification for first person video (FPV) is more challenging than conventional action classification due to strong egocentric motions, frequent changes of viewpoints, and diverse global motion patterns. To tackle these challenges, we introduce a two-stream convolutional neural network that improves action recognition via long-term fusion pooling operators. The proposed method effectively captures the temporal structure of actions by leveraging a series of frame-wise features of both appearance and motion in actions. Our experiments validate the effect of the feature pooling operators, and show that the proposed method achieves state-of-the-art performance on standard action datasets. (c) 2018 Elsevier B.V. All rights reserved.

机译：第一人称动作识别是一个活跃的研究领域，具有越来越流行的可穿戴设备。第一人称视频（FPV）的动作分类比传统的动作分类更具挑战性，原因是强烈的以自我为中心的动作，频繁的视点变化以及各种全局动作模式。为了解决这些挑战，我们引入了两流式卷积神经网络，该网络通过长期的融合池算子来改善动作识别。所提出的方法通过利用动作中的外观和动作的一系列框架特征有效地捕获动作的时间结构。我们的实验验证了特征池算子的效果，并表明所提出的方法在标准动作数据集上达到了最先进的性能。（c）2018 Elsevier B.V.保留所有权利。

著录项

来源
《Pattern recognition letters》 |2018年第1期|161-167|共7页
作者
Kwon Heeseung; Kim Yeonho; Lee Jin S.; Cho Minsu;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. RGB-D Human Action Recognition of Deep Feature Enhancement and Fusion Using Two-Stream ConvNet [J] . Yun Liu, Ruidi Ma, Hui Li, Journal of Sensors . 2021,第a期

机译：RGB-D使用双流ConvNet的深度特征增强和融合的RGB-D
2. Going deeper with two-stream ConvNets for action recognition in video surveillance [J] . Han Yamin, Zhang Peng, Zhuo Tao, Pattern recognition letters . 2018,第MAY1期

机译：使用两流ConvNets进行视频监控中的动作识别更深入
3. Pooling the Convolutional Layers in Deep ConvNets for Video Action Recognition [J] . Shichao Zhao, Yanbin Liu, Yahong Han, IEEE Transactions on Circuits and Systems for Video Technology . 2018,第8期

机译：合并深层卷积网络中的卷积层以进行视频动作识别
4. Two-Stream Gated Fusion ConvNets for Action Recognition [C] . Jiagang Zhu, Wei Zou, Zheng Zhu International Conference on Pattern Recognition . 2018

机译：两流门控融合ConvNets用于动作识别
5. Multimodal Data Creation, Fusion, and Recognition of Action Units Using Deep Learning Models [D] . Zhang, Zheng. 2020

机译：使用深度学习模型的多模式数据创建，融合和识别行动单位
6. Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion [O] . Yu Su, Ke Zhang, Jingyu Wang, 2019

机译：基于决策级融合的两流CNN环境声音分类
7. RGB-D Human Action Recognition of Deep Feature Enhancement and Fusion Using Two-Stream ConvNet [O] . Yun Liu, Ruidi Ma, Hui Li, 2021

机译：RGB-D使用双流ConvNet的深度特征增强和融合的RGB-D

First Person Action Recognition via Two-stream ConvNet with Long-term Fusion Pooling

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅