首页> 外文会议>IEEE International Conference on Image Processing >Analysis Of Neural Image Compression Networks For Machine-To-Machine Communication
【24h】

Analysis Of Neural Image Compression Networks For Machine-To-Machine Communication

机译:用于机器到机器通信的神经图像压缩网络分析

获取原文

摘要

Video and image coding for machines (VCM) is an emerging field that aims to develop compression methods resulting in optimal bitstreams when the decoded frames are analyzed by a neural network. Several approaches already exist improving classic hybrid codecs for this task. However, neural compression networks (NCNs) have made an enormous progress in coding images over the last years. Thus, it is reasonable to consider such NCNs, when the information sink at the decoder side is a neural network as well. Therefore, we build-up an evaluation framework analyzing the performance of four state-of-the-art NCNs, when a Mask R-CNN is segmenting objects from the decoded image. The compression performance is measured by the weighted average precision for the Cityscapes dataset. Based on that analysis, we find that networks with leaky ReLU as non-linearity and training with SSIM as distortion criteria results in the highest coding gains for the VCM task. Furthermore, it is shown that the GAN-based NCN architecture achieves the best coding performance and even out-performs the recently standardized Versatile Video Coding (VVC) for the given scenario.
机译:用于机器(VCM)的视频和图像编码是一种新兴领域,其旨在开发压缩方法,当通过神经网络分析解码的帧时产生最佳比特流。已经存在了几种方法,改善了此任务的经典混合编解码器。然而,神经压缩网络(NCNS)在过去几年中对编码图像进行了巨大进展。因此,当解码器侧的信息接收器也是神经网络时,需要考虑这样的NCN是合理的。因此,我们建立一个评估框架,分析四个最先进的NCNS的性能,当掩模R-CNN是来自解码图像的分段对象时。压缩性能是通过CityScapes数据集的加权平均精度来衡量的。基于该分析,我们发现具有泄漏的网络作为与SSIM的非线性和培训,作为失真标准导致VCM任务的最高编码增益。此外,示出了基于GaN的NCN架构实现了最佳编码性能,甚至为给定方案执行最近标准化的多功能视频编码(VVC)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号