Temporal Domain Neural Encoder for Video Representation Learning

机译：用于视频表示学习的时域神经编码器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the challenge of learning good video representations by explicitly modeling the relationship between visual concepts in time space. We propose a novel Temporal Preserving Recurrent Neural Network (TPRNN) that extracts and encodes visual dynamics with frame-level features as input. The proposed network architecture captures temporal dynamics by keeping track of the ordinal relationship of co-occurring visual concepts, and constructs video representations with their temporal order patterns. The resultant video representations effectively encode temporal information of dynamic patterns, which makes them more discriminative to human actions performed with different sequences of action patterns. We evaluate the proposed model on several real video datasets, and the results show that it successfully outperforms the baseline models. In particular, we observe significant improvement on action classes that can only be distinguished by capturing the temporal orders of action patterns.

机译：通过在时空中显式建模视觉概念之间的关系，我们解决了学习良好视频表示形式的挑战。我们提出了一种新颖的临时保存递归神经网络（TPRNN），该方法可以提取和编码具有帧级特征作为输入的视觉动态。所提出的网络体系结构通过跟踪共同出现的视觉概念的序数关系来捕获时间动态，并使用其时间顺序模式构造视频表示。所得的视频表示有效地编码了动态模式的时间信息，这使它们对于以不同的动作模式序列执行的人类动作更具区分性。我们在几个真实的视频数据集上评估了提出的模型，结果表明该模型成功地胜过了基线模型。特别是，我们观察到动作类的重大改进，这只能通过捕获动作模式的时间顺序来区分。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition Workshops》|2017年|2192-2199|共8页
会议地点
作者
Hao Hu; Zhaowen Wang; Joon-Young Lee; Zhe Lin; Guo-Jun Qi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Computer architecture; Mathematical model; Logic gates; Feature extraction; Pattern recognition; Computer vision;

机译：可视化;计算机体系结构;数学模型;逻辑门;特征提取;模式识别;计算机视觉;

相似文献

外文文献
中文文献
专利

1. Encoding a temporally structured stimulus with a temporally structured neural representation. [J] . Brown SL, Joseph J, Stopfer M Nature neuroscience . 2005,第11期

机译：用时间结构化的神经表示对时间结构化的刺激进行编码。
2. Domain-invariant representation learning using an unsupervised domain adversarial adaptation deep neural network [J] . Jia Xibin, Jin Ya, Su Xing, Neurocomputing . 2019,第AUGa25期

机译：使用无监督领域对抗性适应性深度神经网络的领域不变表示学习
3. Domain-invariant representation learning using an unsupervised domain adversarial adaptation deep neural network [J] . Jia Xibin, Jin Ya, Su Xing, Neurocomputing . 2019,第Auga25期

机译：域名不变的表示学习使用无监督的域对抗性适应深神经网络
4. Temporal Domain Neural Encoder for Video Representation Learning [C] . Hao Hu, Zhaowen Wang, Joon-Young Lee, IEEE Conference on Computer Vision and Pattern Recognition Workshops . 2017

机译：用于视频表示学习的时间域神经编码器
5. Image-set, Temporal and Spatiotemporal Representations of Videos for Recognizing, Localizing and Quantifying Actions [D] . Xiang, Xiang. 2018

机译：用于识别，定位和量化动作的视频的图像集，时间和时空表示
6. Neural Encoding and Representation of Time for Sensorimotor Control and Learning [O] . Ramesh Balasubramaniam, Saskia Haegens, Mehrdad Jazayeri, 2021

机译：传感器控制和学习时间的神经编码和时间
7. Online spatio-temporal pattern recognition with evolving spiking neural networks utilising address event representation, rank order, and temporal spike learning [O] . Dhoble, K, Nuntalid, N, Indiveri, G, 2012

机译：利用地址事件表示，等级顺序和时间峰值学习的不断发展的尖峰神经网络进行在线时空模式识别

Temporal Domain Neural Encoder for Video Representation Learning

摘要

著录项

相似文献

相关主题

期刊订阅