Motion-Aware Feature Enhancement Network for Video Prediction

Lin Xue; Zou Qi; Xu Xixia; Huang Yaping; Tian Yi

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Motion-Aware Feature Enhancement Network for Video Prediction

【24h】

Motion-Aware Feature Enhancement Network for Video Prediction

机译：运动感知功能增强网络用于视频预测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video prediction is challenging, due to the pixel-level precision requirement and the difficulty in capturing scene dynamics. Most approaches tackle the problems by pixel-level reconstruction objectives and two decomposed branches, which still suffer from blurry generations or dramatic degradations in long-term prediction. In this paper, we propose a Motion-Aware Feature Enhancement (MAFE) network for video prediction to produce realistic future frames and achieve relatively long-term predictions. First, a Channel-wise and Spatial Attention (CSA) module is designed to extract motion-aware features, which enhances the contribution of important motion details during encoding, and subsequently improves the discriminability of attention map for the frame refinement. Second, a Motion Perceptual Loss (MPL) is proposed to guide the learning of temporal cues, which benefits to robust long-term video prediction. Extensive experiments on three human activity video datasets: KTH, Human3.6M, and PennAction demonstrate the effectiveness of the proposed video prediction model compared with the state-of-the-art approaches.

机译：由于像素级精度要求和捕获场景动态的难度，视频预测是具有挑战性的。大多数方法通过像素级重建目标和两个分解的分支来解决问题，这仍然遭受模糊的几代或长期预测中的剧烈降低。在本文中，我们提出了一种动作感知功能增强（MAFE）网络，用于视频预测，以产生逼真的未来帧并实现相对长期的预测。首先，旨在提取通道和空间注意（CSA）模块以提取运动感知特征，这增强了在编码期间的重要运动细节的贡献，随后提高了帧细化的关注图的可怜。其次，提出了运动感知损失（MPL）以指导时间线索的学习，这有利于强大的长期视频预测。对三个人类活动的广泛实验视频数据集：Kth，Human3.6M和Pennaction展示了所提出的视频预测模型的有效性与最先进的方法相比。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology》 |2021年第2期|688-700|共13页
作者
Lin Xue; Zou Qi; Xu Xixia; Huang Yaping; Tian Yi;
展开▼
作者单位

Beijing Jiaotong Univ Beijing Key Lab Traff Data Anal & Min Beijing 100044 Peoples R China;

Beijing Jiaotong Univ Beijing Key Lab Traff Data Anal & Min Beijing 100044 Peoples R China;

Beijing Jiaotong Univ Beijing Key Lab Traff Data Anal & Min Beijing 100044 Peoples R China;

Beijing Jiaotong Univ Beijing Key Lab Traff Data Anal & Min Beijing 100044 Peoples R China;

Beijing Jiaotong Univ Beijing Key Lab Traff Data Anal & Min Beijing 100044 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Predictive models; Encoding; Multiprotocol label switching; Stochastic processes; Dynamics; Feature extraction; Task analysis; Video prediction; unsupervised learning; attention mechanism; perceptual loss;

机译：预测模型;编码;多协议标签切换;随机过程;动态;特征提取;任务分析;视频预测;无监督学习;注意机制;感性损失;

相似文献

外文文献
中文文献
专利

1. Video object segmentation based on motion-aware ROI prediction and adaptive reference updating [J] . Fu Lihua, Zhao Yu, Sun Xiaowei, Expert systems with applications . 2021,第Apra期

机译：基于运动感知ROI预测和自适应参考更新的视频对象分割
2. Enhanced Intra Prediction for Video Coding by Using Multiple Neural Networks [J] . Heming Sun, Zhengxue Cheng, Masaru Takeuchi, Multimedia, IEEE Transactions on . 2020,第11期

机译：通过使用多个神经网络增强视频编码的帧内预测
3. An enhanced trust prediction strategy for online social networks using probabilistic reputation features [J] . Raj Ebin Deni, Babu L. D. Dhinesh Neurocomputing . 2017,第JANa5期

机译：使用概率信誉功能的在线社交网络的增强的信任预测策略
4. Motion-Aware Deep Video Coding Network [C] . Rida Khan, Ying Liu Conference on Big Data : Learning, Analytics, and Applications . 2020

机译：运动感知深度视频编码网络
5. Kaizen Programming with Enhanced Feature Discovery: An Automated Approach to Feature Selection and Feature Discovery for Prediction Models [D] . Stelmack, John. 2020

机译：Kaizen编程，具有增强功能发现：用于预测模型的特征选择和特征发现的自动方法
6. SuSPect: Enhanced Prediction of Single Amino Acid Variant (SAV) Phenotype Using Network Features [O] . Christopher M. Yates, Ioannis Filippis, Lawrence A. Kelley, -1

机译：SuSPect：使用网络功能增强对单个氨基酸变异（SAV）表型的预测
7. Enhanced Intra Prediction for Video Coding by Using Multiple Neural Networks [O] . Heming Sun, Zhengxue Cheng, Masaru Takeuchi, 2020

机译：通过使用多个神经网络增强视频编码的帧内预测

Motion-Aware Feature Enhancement Network for Video Prediction

摘要

著录项

相似文献

相关主题

期刊订阅