Deep Network-Based Frame Extrapolation With Reference Frame Alignment

Huo Shuai; Liu Dong; Li Bin; Ma Siwei; Wu Feng; Gao Wen

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Deep Network-Based Frame Extrapolation With Reference Frame Alignment

【24h】

Deep Network-Based Frame Extrapolation With Reference Frame Alignment

机译：基于深度网络的帧外推，参考帧对齐

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Frame extrapolation is to predict future frames from the past (reference) frames, which has been studied intensively in the computer vision research and has great potential in video coding. Recently, a number of studies have been devoted to the use of deep networks for frame extrapolation, which achieves certain success. However, due to the complex and diverse motion patterns in natural video, it is still difficult to extrapolate frames with high fidelity directly from reference frames. To address this problem, we introduce reference frame alignment as a key technique for deep network-based frame extrapolation. We propose to align the reference frames, e.g. using block-based motion estimation and motion compensation, and then to extrapolate from the aligned frames by a trained deep network. Since the alignment, a preprocessing step, effectively reduces the diversity of network input, we observe that the network is easier to train and the extrapolated frames are of higher quality. We verify the proposed technique in video coding, using the extrapolated frame for inter prediction in High Efficiency Video Coding (HEVC) and Versatile Video Coding (VVC). We investigate different schemes, including whether to align between the target frame and the reference frames, and whether to perform motion estimation on the extrapolated frame. We conduct a comprehensive set of experiments to study the efficiency of the proposed method and to compare different schemes. Experimental results show that our proposal achieves on average 5.3% and 2.8% BD-rate reduction in Y component compared to HEVC, under low-delay P and low-delay B configurations, respectively. Our proposal performs much better than the frame extrapolation without reference frame alignment.

机译：帧外推是预测过去（参考）帧的未来帧，该帧已经在计算机视觉研究中深入研究，并且在视频编码中具有很大的潜力。最近，许多研究已经致力于利用深网络用于框架推断，这实现了某些成功。然而，由于自然视频中的复杂和多样化的运动模式，仍然难以从参考框架中直接与高保真的框架外推。为了解决这个问题，我们将参考帧对齐作为基于深网络的帧外推的关键技术。我们建议对准参考框架，例如，使用基于块的运动估计和运动补偿，然后通过训练的深网络从对准帧外推。由于对准，预处理步骤，有效地减少了网络输入的分集，我们观察到网络更容易训练，并且外推帧质量更高。我们使用用于在高效视频编码（HEVC）和多功能视频编码（VVC）中的用于帧间预测的外推帧来验证视频编码中所提出的技术。我们调查不同的方案，包括是否在目标帧和参考帧之间对齐，以及是否对推断帧执行运动估计。我们开展一套全面的实验，以研究提出的方法的效率并比较不同的方案。实验结果表明，与低延迟P和低延延迟B配置，我们的提案平均平均为5.3％和2.8％的BD速率降低y组分，与HEVC相比。我们的提议在没有参考帧对齐的情况下比帧外推性能更好。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology》 |2021年第3期|1178-1192|共15页
作者
Huo Shuai; Liu Dong; Li Bin; Ma Siwei; Wu Feng; Gao Wen;
展开▼
作者单位

Univ Sci & Technol China CAS Key Lab Technol Geospatial Informat Proc & Ap Hefei 230027 Peoples R China;

Univ Sci & Technol China CAS Key Lab Technol Geospatial Informat Proc & Ap Hefei 230027 Peoples R China;

Microsoft Res Asia Beijing 100080 Peoples R China;

Peking Univ Inst Digital Media Beijing 100871 Peoples R China;

Univ Sci & Technol China CAS Key Lab Technol Geospatial Informat Proc & Ap Hefei 230027 Peoples R China;

Peking Univ Inst Digital Media Beijing 100871 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Extrapolation; Motion estimation; Interpolation; Computer vision; High efficiency video coding; Proposals; Deep network; frame extrapolation; high efficiency video coding (HEVC); inter prediction; motion compensation; motion estimation; versatile video coding (VVC);

机译：推断;运动估计;插值;计算机愿景;高效视频编码;提案;深网络;帧外推;高效视频编码（HEVC）;帧间预测;运动补偿;运动估计;通用视频编码（VVC）;

相似文献

外文文献
中文文献
专利

1. Network-Based H.264/AVC Whole-Frame Loss Visibility Model and Frame Dropping Methods [J] . Chang Y.-L., Lin T.-L., Cosman P. C. Image Processing, IEEE Transactions on . 2012,第8期

机译：基于网络的H.264 / AVC全帧丢失可见性模型和丢帧方法
2. Impact of the terrestrial reference frame on the determination of the celestial reference frame [J] . Maria Karbon, Santiago Belda, Tobias Nilsson 大地测量与地球动力学（英文版） . 2019,第001期

机译：地面参考系对天体参考系确定的影响
3. Practical reference-frame-independent quantum key distribution systems against the worst relative rotation of reference frames [J] . Chun-Mei Zhang, Jian-Rong Zhu, Qin Wang Journal of Physics Communications . 2018,第5期

机译：实用的独立于参考系的量子密钥分发系统，可抵抗参考系最坏的相对旋转
4. Generative Adversarial Network-Based Frame Extrapolation for Video Coding [C] . Jianping Lin, Dong Liu, Houqiang Li, IEEE Visual Communications and Image Processing Conference . 2018

机译：基于生成对抗网络的视频编码帧外推
5. Rate of alignment and communication using quantum systems in the absence of a shared frame of reference. [D] . Skotiniotis, Michael. 2012

机译：在没有共享参考系的情况下使用量子系统进行对准和通信的速率。
6. Are All Spatial Reference Frames Egocentric? Reinterpreting Evidence for Allocentric Object-Centered or World-Centered Reference Frames [O] . Flavia Filimon 2015

机译：所有空间参照系都以自我为中心吗？重新解释同心圆以对象为中心或以世界为中心的参考系的证据
7. Occupational therapy intervention with a child is based upon an understanding and appreciation of normal development. Knowledge of current concepts and theories related to child development is essential when occupational therapist evaluates children. This background information helps therapist to plan intervention for the child. The aim of this study is to make observation video about development of about one year old child. The purpose of my study is to help occupational therapy students learn about child development. My study is practice-based thesis. It includes product, which is the observation video and study rapport. I describe my whole process in my rapport. The process includes different kinds of stages. First, I studied those theories of child development, which are used in the studies of occupational therapy for children. These theories are Moseys Developmental Frame of Reference and the theory of development according to Sensory Integration Theory. These theories are the frames of reference of my study. I organize the child development areas according to child occupations and skills. Then I start to plan, film and edit my video based on the theories of child development and the principles of making a video. In my rapport I describe all the stages of my study and explain the sequence and the content of the stages. I also evaluate the process of my study. In the observation video you can see those stages of development where about one year old child is based on the frames of reference, which I have used in my study. I believe that my observation video can at least be good for inspiring occupational therapy students learning about child development. Keywords child development, learning, observation video [O] . Lehtinen Ann-Mari 2006

机译：对儿童的职业治疗干预基于对正常发育的理解和欣赏。当职业治疗师评估儿童时，与儿童发育相关的当前概念和理论的知识必不可少。这些背景信息可帮助治疗师为孩子计划干预措施。这项研究的目的是制作有关约一岁儿童发育的观察视频。我研究的目的是帮助职业治疗学生学习儿童成长。我的研究是基于实践的论文。它包括产品，这是观察视频和学习融洽的关系。我以融洽的方式描述我的整个过程。该过程包括不同阶段。首先，我研究了有关儿童发育的理论，这些理论被用于儿童的职业治疗研究中。这些理论是Moseys发展参考框架和根据感觉统合理论的发展理论。这些理论是我研究的参考框架。我根据儿童职业和技能组织儿童发展领域。然后，我根据儿童发育理论和视频制作原理开始计划，拍摄和编辑视频。在融洽的关系中，我描述了学习的所有阶段，并解释了这些阶段的顺序和内容。我还评估了我的学习过程。在观察视频中，您可以看到那些发展阶段，其中大约一岁的孩子基于我的研究框架。我相信，我的观察视频至少可以对激发职业治疗的学生学习儿童发育有帮助。关键字儿童发展，学习，观察视频
8. Optical Measurement of the Difference in Alignment Between Reference Frames. [R] . Snowdown, J. C. 1975

机译：参考帧间对准差异的光学测量。

Deep Network-Based Frame Extrapolation With Reference Frame Alignment

摘要

著录项

相似文献

相关主题

期刊订阅