首页> 外文会议>IEEE International Conference on Multimedia and Expo >Temporal Segment Convolutional Kernel Networks for Sequence Modeling of Videos
【24h】

Temporal Segment Convolutional Kernel Networks for Sequence Modeling of Videos

机译:用于视频序列建模的时间段卷积内核网络

获取原文

摘要

Sequence modeling is crucial for video action recognition. In this paper, we propose temporal segment convolutional kernel networks (TS-CKN), where we take advantage of convolutional neural networks to facilitate the extraction of appearance features, while time sequence is modeled with deep kernel networks. We employ the kernel methods to capture time-varying information of videos and propose a training method for kernel map approximation by matrix backpropagation. This leads to the model named deep kernel networks which can be easily integrated with existing deep learning models such as Resnet. Our approach also samples several video clips sparsely in the video and unifies class predictions from all clips. More importantly, all parameters of our model can be learned by stochastic optimization in an end-to-end manner. We evaluate our method on two standard action recognition datasets including HMDB-51 and UCF-101, achieving the state-of-the-art results.
机译:序列建模对于视频动作识别至关重要。在本文中,我们提出了时间段卷积核网络(TS-CKN),其中我们利用卷积神经网络来促进外观特征的提取,同时使用深核网络对时间序列进行建模。我们采用核方法来捕获视频的时变信息,并提出了一种通过矩阵反向传播进行核图逼近的训练方法。这导致了名为深度内核网络的模型,该模型可以轻松地与现有的深度学习模型(例如Resnet)集成。我们的方法还会在视频中稀疏地采样几个视频剪辑,并统一所有剪辑的班级预测。更重要的是,我们模型的所有参数都可以通过端到端的随机优化来学习。我们在包括HMDB-51和UCF-101在内的两个标准动作识别数据集上评估了我们的方法,从而获得了最新的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号