GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity

Shinobu KUDO; Shota ORIHASHI; Ryuichi TANIDA; Seishi TAKAMURA; Hideaki KIMATA

首页> 外文期刊>IEICE transactions on information and systems >GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity

【24h】

GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity

机译：基于GaN的图像压缩，使用相互信息优化主观图像相似性

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, image compression systems based on convolutional neural networks that use flexible nonlinear analysis and synthesis transformations have been developed to improve the restoration accuracy of decoded images. Although these methods that use objective metric such as peak signal-to-noise ratio and multi-scale structural similarity for optimization attain high objective results, such metric may not reflect human visual characteristics and thus degrade subjective image quality. A method using a framework called a generative adversarial network (GAN) has been reported as one of the methods aiming to improve the subjective image quality. It optimizes the distribution of restored images to be close to that of natural images; thus it suppresses visual artifacts such as blurring, ringing, and blocking. However, since methods of this type are optimized to focus on whether the restored image is subjectively natural or not, components that are not correlated with the original image are mixed into the restored image during the decoding process. Thus, even though the appearance looks natural, subjective similarity may be degraded. In this paper, we investigated why the conventional GAN-based compression techniques degrade subjective similarity, then tackled this problem by rethinking how to handle image generation in the GAN framework between image sources with different probability distributions. The paper describes a method to maximize mutual information between the coding features and the restored images. Experimental results show that the proposed mutual information amount is clearly correlated with subjective similarity and the method makes it possible to develop image compression systems with high subjective similarity.

机译：最近，已经开发了基于使用灵活的非线性分析和合成变换的卷积神经网络的图像压缩系统，以提高解码图像的恢复精度。虽然这些方法使用客观度量的诸如峰值信噪比和多尺度结构相似性的用于优化获得高目标结果，但是这种度量可能不反映人类的视觉特征，从而降低主观图像质量。使用称为生成的对冲网络（GaN）的框架的方法被报告为旨在提高主观图像质量的方法之一。它优化恢复图像的分布接近自然图像;因此，它抑制了视觉伪像，例如模糊，振铃和阻挡。然而，由于这种类型的方法被优化以专注于恢复的图像是否是主观自然的，因此在解码过程期间，与原始图像不相关的组件将不会被混合到恢复的图像中。因此，即使外观看起来很自然，即使外观很自然，主观相似度可能会降低。在本文中，我们研究了为什么传统的GaN基压缩技术劣化主观相似度，然后通过重新思考如何处理具有不同概率分布的图像源之间的GaN框架中的图像生成来解决这个问题。本文介绍了一种最大化编码特征和恢复图像之间的互信息的方法。实验结果表明，所提出的互信息量与主观相似度明显相关，并且该方法使得可以开发具有高主观相似性的图像压缩系统。

著录项

来源
《IEICE transactions on information and systems》 |2021年第3期|共11页
作者
Shinobu KUDO; Shota ORIHASHI; Ryuichi TANIDA; Seishi TAKAMURA; Hideaki KIMATA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
image compressionCNNGANmutual information;

机译：图像压缩线控信息;

相似文献

外文文献
中文文献
专利

1. Overview on subjective similarity of images for content-based medical image retrieval [J] . Chisako Muramatsu Radiological physics and technology . 2018,第2期

机译：基于内容的医学图像检索图像的主观相似性概述
2. Grey is the new RGB: How good is GAN-based image colorization for image compression? [J] . Aroosh Fatima, Wajahat Hussain, Shahzad Rasool Multimedia Tools and Applications . 2021,第3期

机译：灰色是新的RGB：GaN的图像颜色有多好图像压缩？
3. Simplified Digital Still Image Compression Using Histograms of Wavelet Expansion Coefficients and Subjective Fidelity Estimation between Similar Reconstructed Images [J] . M. IIZUKA, M. KITAMURA, Y. NAKASHIMA Journal of Light & Visual Environment . 1996,第2期

机译：使用小波展开系数直方图和相似重构图像之间的主观保真度估计的简化数字静止图像压缩
4. Scene Retrieval Using Text-to-image GAN-based Visual Similarities and Image-to-text Model-based Textual Similarities [C] . Rintaro Yanagi, Ren Togo, Takahiro Ogawa, IEEE Global Conference on Consumer Electronics . 2019

机译：使用基于文本到图像GAN的视觉相似度和基于图像到文本模型的文本相似度进行场景检索
5. Structural similarity based image quality assessment: Pooling strategies and applications to image compression and digit recognition. [D] . Shang, Xinli. 2006

机译：基于结构相似性的图像质量评估：合并策略及其在图像压缩和数字识别中的应用。
6. Subjective Vividness of Kinesthetic Motor Imagery Is Associated With the Similarity in Magnitude of Sensorimotor Event-Related Desynchronization Between Motor Execution and Motor Imagery [O] . Hisato Toriyama, Junichi Ushiba, Junichi Ushiyama 2018

机译：运动运动图像的主观生动性与运动执行和运动图像之间与感觉运动相关的失步幅度相似性相关
7. GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity [O] . Shinobu KUDO, Shota ORIHASHI, Ryuichi TANIDA, 2021

机译：基于GaN的图像压缩，使用相互信息优化主观图像相似性

GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity

摘要

著录项

相似文献

相关主题

期刊订阅