首页> 外文期刊>Neurocomputing >A recurrent video quality enhancement framework with multi-granularity frame-fusion and frame difference based attention
【24h】

A recurrent video quality enhancement framework with multi-granularity frame-fusion and frame difference based attention

机译:具有多粒度帧融合和基于帧差异的经常性视频质量增强框架

获取原文
获取原文并翻译 | 示例
       

摘要

In recent years, deep learning has attracted substantial research attention for video restoration. Among the existing contributions, the single-frame based approaches purely rely on one reference frame and neglect the rest neighboring frames when enhancing a target frame. By contrast, the multi-frame based contributions exploit temporal information in a sliding window and the existing recurrent design only employ a single preceding enhanced frame. It is intuitive to exploit both multiple original neighboring frames and the preceding enhanced frames for video quality enhancement. In this paper, we propose a Recurrent video quality Enhancement framework with Multi-granularity frame-fusion and frame Difference based attention (REMD). Firstly, we devise a three-dimensional convolutional neural network based encoder-decoder fusion model, which fuses multiple frames in multi-granularity. Secondly, severe compression artifacts tend to emerge on the edges and textures of the compressed frames. We propose a frame difference based spatial attention method to intensify the edges and textures of motioning regions. Finally, a recurrent sliding window design is conceived for exploiting the temporal information in preceding enhanced frames and subsequent neighboring frames. Experiments demonstrate that our method achieves superior performance in comparison to the state-of-the-art contributions with substantially reduced spatial and computational complexity. (C) 2020 Elsevier B.V. All rights reserved.
机译:近年来,深入学习吸引了对视频恢复的大量研究。在现有贡献中,基于单帧的方法纯粹依赖于一个参考帧,并且在增强目标帧时忽略其余相邻帧。相比之下,基于多帧的贡献利用滑动窗口中的时间信息,并且现有的复发设计仅采用单个前面的增强帧。利用多个原始相邻帧和前面的增强帧进行视频质量增强是直观的。在本文中,我们提出了一种具有多粒度帧融合和基于帧差异的经常性视频质量增强框架(REMD)。首先,我们设计了一种三维卷积神经网络的基于编码器解码器融合模型,其在多粒度中熔化多个帧。其次,严重的压缩伪像倾向于出现在压缩框架的边缘和纹理上。我们提出了一种基于帧差的空间注意方法,以加强运动区域的边缘和纹理。最后,构思反复滑动窗设计,用于利用前面增强帧和随后的相邻帧中的时间信息。实验表明,与最先进的空间和计算复杂性的最先进的贡献相比,我们的方法实现了卓越的性能。 (c)2020 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2021年第28期|34-46|共13页
  • 作者单位

    Shenzhen Univ Natl Engn Lab Big Data Syst Comp Technol Sch Comp Sci & Software Engn Shenzhen 518060 Peoples R China|Shenzhen Univ Res Inst Future Media Comp Sch Comp Sci & Software Engn Shenzhen 518060 Peoples R China;

    Shenzhen Univ Natl Engn Lab Big Data Syst Comp Technol Sch Comp Sci & Software Engn Shenzhen 518060 Peoples R China|Shenzhen Univ Res Inst Future Media Comp Sch Comp Sci & Software Engn Shenzhen 518060 Peoples R China;

    Beijing Univ Posts & Telecommun Sch Informat Engn Beijing 100876 Peoples R China|Minist Educ Lab Univ Wireless Commun Beijing 100876 Peoples R China;

    Shenzhen Univ Natl Engn Lab Big Data Syst Comp Technol Sch Comp Sci & Software Engn Shenzhen 518060 Peoples R China|Shenzhen Univ Res Inst Future Media Comp Sch Comp Sci & Software Engn Shenzhen 518060 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Quality enhancement; Attention map; Video restoration; Recurrent design; 3D-CNN;

    机译:质量增强;注意地图;视频恢复;经常性设计;3D-CNN;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号