首页> 外文期刊>IEEE Transactions on Image Processing >Invertibility-Driven Interpolation Filter for Video Coding
【24h】

Invertibility-Driven Interpolation Filter for Video Coding

机译:用于视频编码的可逆驱动内插滤波器

获取原文
获取原文并翻译 | 示例
           

摘要

Motion compensation with fractional motion vector has been widely utilized in the video coding standards. The fractional samples are usually generated by fractional interpolation filters. Traditional interpolation filters are usually designed based on the signal processing theory with the assumption of band-limited signal, which cannot effectively capture the non-stationary property of video content and cannot adapt to the variety of video quality. In this paper, we reveal an intuitive property of the fractional interpolation problem, named invertibility. That is, the fractional interpolation filters should not only generate fractional samples from integer samples but also recover the integer samples from the fractional samples in an invertible manner. We prove in theory that the invertibility in the spatial domain is equivalent to the constant magnitude in the Fourier transform domain. Driven by the invertibility, we then develop a learning-based method to solve the fractional interpolation problem. Inspired by the advances of convolutional neural network (CNN), we propose to establish an end-to-end scheme using CNN to train invertibility-driven interpolation filter (InvIF). Different from the previous learning-based methods, the proposed training scheme does not need hand-crafted "ground truth" of fractional samples. The proposed InvIF is integrated into high efficiency video coding (HEVC), and extensive experiments are conducted to verify its effectiveness. The experimental results show that the proposed method can achieve on average 4.7% and 3.6% BD-rate reduction compared with the HEVC anchor, under low-delay-B and random-access configurations, respectively.
机译:具有分数运动矢量的运动补偿已在视频编码标准中得到广泛利用。分数样本通常由分数插值滤波器生成。传统的插值滤波器通常是在信号处理理论的基础上,以带宽受限的信号为前提进行设计的,不能有效地捕获视频内容的非平稳性,不能适应视频质量的变化。在本文中,我们揭示了分数插值问题的直观属性,称为可逆性。即,分数内插滤波器不仅应该从整数样本生成分数样本,而且还应该以可逆的方式从分数样本恢复整数样本。我们从理论上证明,空间域中的可逆性等于傅立叶变换域中的恒定幅度。在可逆性的驱动下,我们开发了一种基于学习的方法来解决分数插值问题。受卷积神经网络(CNN)进步的启发,我们建议使用CNN建立端到端方案来训练可逆性驱动的插值滤波器(InvIF)。与以前的基于学习的方法不同,所提出的训练方案不需要手工制作的分数样本的“地面真理”。提出的InvIF已集成到高效视频编码(HEVC)中,并进行了广泛的实验以验证其有效性。实验结果表明,与HEVC锚相比,在低延迟B和随机访问配置下,所提出的方法可以分别平均降低BD速率4.7%和3.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号