...
首页> 外文期刊>IEICE transactions on information and systems >GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity
【24h】

GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity

机译:基于GaN的图像压缩,使用相互信息优化主观图像相似性

获取原文
           

摘要

Recently, image compression systems based on convolutional neural networks that use flexible nonlinear analysis and synthesis transformations have been developed to improve the restoration accuracy of decoded images. Although these methods that use objective metric such as peak signal-to-noise ratio and multi-scale structural similarity for optimization attain high objective results, such metric may not reflect human visual characteristics and thus degrade subjective image quality. A method using a framework called a generative adversarial network (GAN) has been reported as one of the methods aiming to improve the subjective image quality. It optimizes the distribution of restored images to be close to that of natural images; thus it suppresses visual artifacts such as blurring, ringing, and blocking. However, since methods of this type are optimized to focus on whether the restored image is subjectively natural or not, components that are not correlated with the original image are mixed into the restored image during the decoding process. Thus, even though the appearance looks natural, subjective similarity may be degraded. In this paper, we investigated why the conventional GAN-based compression techniques degrade subjective similarity, then tackled this problem by rethinking how to handle image generation in the GAN framework between image sources with different probability distributions. The paper describes a method to maximize mutual information between the coding features and the restored images. Experimental results show that the proposed mutual information amount is clearly correlated with subjective similarity and the method makes it possible to develop image compression systems with high subjective similarity.
机译:最近,已经开发了基于使用灵活的非线性分析和合成变换的卷积神经网络的图像压缩系统,以提高解码图像的恢复精度。虽然这些方法使用客观度量的诸如峰值信噪比和多尺度结构相似性的用于优化获得高目标结果,但是这种度量可能不反映人类的视觉特征,从而降低主观图像质量。使用称为生成的对冲网络(GaN)的框架的方法被报告为旨在提高主观图像质量的方法之一。它优化恢复图像的分布接近自然图像;因此,它抑制了视觉伪像,例如模糊,振铃和阻挡。然而,由于这种类型的方法被优化以专注于恢复的图像是否是主观自然的,因此在解码过程期间,与原始图像不相关的组件将不会被混合到恢复的图像中。因此,即使外观看起来很自然,即使外观很自然,主观相似度可能会降低。在本文中,我们研究了为什么传统的GaN基压缩技术劣化主观相似度,然后通过重新思考如何处理具有不同概率分布的图像源之间的GaN框架中的图像生成来解决这个问题。本文介绍了一种最大化编码特征和恢复图像之间的互信息的方法。实验结果表明,所提出的互信息量与主观相似度明显相关,并且该方法使得可以开发具有高主观相似性的图像压缩系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号