首页> 外文会议>2018 IEEE International Work Conference on Bioinspired Intelligence >Analysis of Content-Aware Image Compression with VGG16
【24h】

Analysis of Content-Aware Image Compression with VGG16

机译:使用VGG16分析内容感知的图像压缩

获取原文
获取原文并翻译 | 示例

摘要

Content-aware compression based on the use of saliency maps aims to improve the interpretability of an image by encoding the more relevant image regions with a higher quality than the rest of the image. This paper revisits two convolutional neural network (CNN) models based on VGG16, multi-structure region of interest (MS-ROI) and class activation map (CAM), which enable the localization of salient image regions. While the MS-ROI model allows for the localization of multiple salient image regions, the CAM model, on the other hand, tends to localize only the most relevant class. We use the contextual information provided by the obtained saliency maps to guide the compression. By encoding more important image regions at a higher bitrate and less important ones at a lower bitrate, different qualities of compression for the regions of interest and the background are obtained, while also achieving smooth transitions from salient to non-salient regions. The performance of both models is evaluated on images from the MIT Saliency Benchmark dataset and the General-100 dataset, and the results of the compression are compared to the standard JPEG compression at different quality factors. Experimental results show that for the files of approximately same size, the compression methods based on the two CNN models outperform the standard JPEG compression. When comparing the compression based on the MS-ROI model to the compression based on the CAM model, the former is characterized by a higher PSNR and a better visual quality of the obtained images.
机译:基于显着性图的基于内容的压缩旨在通过以比图像其余部分更高的质量编码更相关的图像区域来提高图像的可解释性。本文回顾了基于VGG16的两个卷积神经网络(CNN)模型,多结构感兴趣区域(MS-ROI)和类激活图(CAM),它们可以对显着图像区域进行定位。虽然MS-ROI模型可以对多个显着图像区域进行定位,但是CAM模型却倾向于仅对最相关的类进行定位。我们使用获得的显着性图提供的上下文信息来指导压缩。通过以较高的比特率编码较重要的图像区域,以较低的比特率编码较不重要的图像区域,可以获得感兴趣区域和背景的不同压缩质量,同时还实现了从显着区域到非显着区域的平滑过渡。在MIT Saliency Benchmark数据集和General-100数据集的图像上评估了这两种模型的性能,并将压缩结果与不同质量因子下的标准JPEG压缩进行了比较。实验结果表明,对于大约相同大小的文件,基于两个CNN模型的压缩方法优于标准JPEG压缩。当将基于MS-ROI模型的压缩与基于CAM模型的压缩进行比较时,前者的特点是获得的图像具有更高的PSNR和更好的视觉质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号