首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Learning structure of stereoscopic image for no-reference quality assessment with convolutional neural network
【24h】

Learning structure of stereoscopic image for no-reference quality assessment with convolutional neural network

机译:卷积神经网络用于无参考质量评估的立体图像学习结构

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose to learn the structures of stereoscopic image based on convolutional neural network (CNN) for no-reference quality assessment. Taking image patches from the stereoscopic images as inputs, the proposed CNN can learn the local structures which are sensitive to human perception and representative for perceptual quality evaluation. By stacking multiple convolution and max-pooling layers together, the learned structures in lower convolution layers can be composed and convolved to higher levels to form a fixed-length representation. Multilayer perceptron (MLP) is further employed to summarize the learned representation to a final value to indicate the perceptual quality of the stereo image patch pair. With different inputs, two different CNNs are designed, namely one-column CNN with only the image patch from the difference image as input, and three-column CNN with the image patches from left-view image, right-view image, and difference image as the input. The CNN parameters for stereoscopic images are learned and transferred based on the large number of 2D natural images. With the evaluation on public LIVE phase-I, LIVE phase-II, and IVC stereoscopic image databases, the proposed no-reference metric achieves the state-of-the-art performance for quality assessment of stereoscopic images, and is even competitive to existing full-reference quality metrics. (C) 2016 Elsevier Ltd. All rights reserved.
机译:本文提出基于卷积神经网络(CNN)的无参考质量评估学习立体图像的结构。以立体图像中的图像块为输入,提出的CNN可以学习对人类感知敏感并代表感知质量评估的局部结构。通过将多个卷积层和max-pooling层堆叠在一起,可以在较低的卷积层中组合学习的结构并将其卷积到更高的级别以形成固定长度的表示形式。进一步使用多层感知器(MLP)将学习到的表示概括为一个最终值,以指示立体图像补丁对的感知质量。在输入不同的情况下,设计了两个不同的CNN,即仅将来自差异图像的图像块作为输入的一列CNN,以及具有来自左视图像,右视图图像和差分图像的图像块的三列CNN。作为输入。基于大量的2D自然图像来学习和传输用于立体图像的CNN参数。通过对公共LIVE第一阶段,LIVE阶段II和IVC立体图像数据库进行评估,提出的无参考指标可实现对立体图像质量评估的最新性能,甚至比现有的具有竞争力全参考质量指标。 (C)2016 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号