Temporal Segment Convolutional Kernel Networks for Sequence Modeling of Videos

机译：用于视频序列建模的时间段卷积内核网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sequence modeling is crucial for video action recognition. In this paper, we propose temporal segment convolutional kernel networks (TS-CKN), where we take advantage of convolutional neural networks to facilitate the extraction of appearance features, while time sequence is modeled with deep kernel networks. We employ the kernel methods to capture time-varying information of videos and propose a training method for kernel map approximation by matrix backpropagation. This leads to the model named deep kernel networks which can be easily integrated with existing deep learning models such as Resnet. Our approach also samples several video clips sparsely in the video and unifies class predictions from all clips. More importantly, all parameters of our model can be learned by stochastic optimization in an end-to-end manner. We evaluate our method on two standard action recognition datasets including HMDB-51 and UCF-101, achieving the state-of-the-art results.

机译：序列建模对于视频动作识别至关重要。在本文中，我们提出了时间段卷积核网络（TS-CKN），其中我们利用卷积神经网络来促进外观特征的提取，同时使用深核网络对时间序列进行建模。我们采用核方法来捕获视频的时变信息，并提出了一种通过矩阵反向传播进行核图逼近的训练方法。这导致了名为深度内核网络的模型，该模型可以轻松地与现有的深度学习模型（例如Resnet）集成。我们的方法还会在视频中稀疏地采样几个视频剪辑，并统一所有剪辑的班级预测。更重要的是，我们模型的所有参数都可以通过端到端的随机优化来学习。我们在包括HMDB-51和UCF-101在内的两个标准动作识别数据集上评估了我们的方法，从而获得了最新的结果。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2019年|1642-1647|共6页
会议地点
作者
Fei Pan; Yanwen Guo; Zhicheng Yan; Jie Guo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Videos; Kernel; Feature extraction; Optical imaging; Modeling; Neural networks; Training;

机译：视频;内核;特征提取;光学成像;建模;神经网络;培训;

相似文献

外文文献
中文文献
专利

1. Biological sequence modeling with convolutional kernel networks [J] . Chen Dexiong, Jacob Laurent, Mairal Julien Bioinformatics . 2019,第18期

机译：卷积核网络的生物序列建模
2. Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos [J] . IEEE transactions on multimedia . 2020,第3期

机译：深度多核卷积LSTM网络和基于注意力的视频机制
3. A deep learning model integrating convolution neural network and multiple kernel K means clustering for segmenting brain tumor in magnetic resonance images [J] . Ragupathy Balakumaresan, Karunakaran Manivannan International journal of imaging systems and technology . 2021,第1期

机译：集成卷积神经网络的深度学习模型和多核K表示磁共振图像中分段脑肿瘤的聚类
4. TEMPORAL SEGMENT CONVOLUTIONAL KERNEL NETWORKS FOR SEQUENCE MODELING OF VIDEOS [C] . Fei Pan, Yanwen Guo, Zhicheng Yan, IEEE International Conference on Multimedia and Expo . 2019

机译：视频序列卷积核网络用于视频序列建模
5. Structure modifiable adaptive reason-building temporal Bayesian Network (SmartBN): Theory and application in human activity and three-dimensional vehicle modeling from video [D] . Ghosh, Nirmalya 2007

机译：结构可修改的自适应原因建立时间贝叶斯网络（SmartBN）：理论和在人类活动和视频中的三维车辆建模中的应用
6. Automatic Change Detection System over Unmanned Aerial Vehicle Video Sequences Based on Convolutional Neural Networks [O] . Víctor García Rubio, Juan Antonio Rodrigo Ferrán, Jose Manuel Menéndez García, 2019

机译：基于卷积神经网络的无人机视频序列自动变化检测系统
7. CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos [O] . Shou, Zheng, Chan, Jonathan, Zareian, Alireza, 2017

机译：CDC：用于精确时间行动的卷积 - 反卷积网络未修剪视频中的本地化

Temporal Segment Convolutional Kernel Networks for Sequence Modeling of Videos

摘要

著录项

相似文献

相关主题

期刊订阅