首页> 外文OA文献 >Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network
【2h】

Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network

机译:使用有效的子像素卷积神经网络的实时单图像和视频超分辨率

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Recently, several models based on deep neural networks have achieved great success in terms of both reconstruction accuracy and computational performance for single image super-resolution. In these methods, the low resolution (LR) input image is upscaled to the high resolution (HR) space using a single filter, commonly bicubic interpolation, before reconstruction. This means that the super-resolution (SR) operation is performed in HR space. We demonstrate that this is sub-optimal and adds computational complexity. In this paper, we present the first convolutional neural network (CNN) capable of real-time SR of 1080p videos on a single K2 GPU. To achieve this, we propose a novel CNN architecture where the feature maps are extracted in the LR space. In addition, we introduce an efficient sub-pixel convolution layer which learns an array of upscaling filters to upscale the final LR feature maps into the HR output. By doing so, we effectively replace the handcrafted bicubic filter in the SR pipeline with more complex upscaling filters specifically trained for each feature map, whilst also reducing the computational complexity of the overall SR operation. We evaluate the proposed approach using images and videos from publicly available datasets and show that it performs significantly better (+0.15dB on Images and +0.39dB on Videos) and is an order of magnitude faster than previous CNN-based methods.
机译:最近,基于深度神经网络的几种模型在重建精度和单图像超分辨率的计算性能方面都取得了巨大成功。在这些方法中,重建前使用单个滤波器(通常是双三次插值)将低分辨率(LR)输入图像放大到高分辨率(HR)空间。这意味着在HR空间中执行超分辨率(SR)操作。我们证明这是次优的,并增加了计算复杂度。在本文中,我们提出了第一个能够在单个K2 GPU上对1080p视频进行实时SR的卷积神经网络(CNN)。为此,我们提出了一种新颖的CNN架构,其中在LR空间中提取了特征图。此外,我们引入了一个有效的子像素卷积层,该层学习一系列的升频滤波器,以将最终的LR特征图升频到HR输出中。这样一来,我们便用更复杂的针对每个功能图训练的更复杂的升频滤波器有效地替换了SR管道中的手工双三次滤波器,同时还降低了整个SR操作的计算复杂性。我们使用公开数据集中的图像和视频评估了提出的方法,结果表明该方法的效果明显更好(图像上+ 0.15dB,视频上+ 0.39dB),并且比以前的基于CNN的方法快一个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号