首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Plug Play Generative Networks: Conditional Iterative Generation of Images in Latent Space
【24h】

Plug Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

机译:即插即用的生成网络:潜在空间中图像的条件迭代生成

获取原文

摘要

Generating high-resolution, photo-realistic images has been a long-standing goal in machine learning. Recently, Nguyen et al. [37] showed one interesting way to synthesize novel images by performing gradient ascent in the latent space of a generator network to maximize the activations of one or multiple neurons in a separate classifier network. In this paper we extend this method by introducing an additional prior on the latent code, improving both sample quality and sample diversity, leading to a state-of-the-art generative model that produces high quality images at higher resolutions (227 × 227) than previous generative models, and does so for all 1000 ImageNet categories. In addition, we provide a unified probabilistic interpretation of related activation maximization methods and call the general class of models Plug and Play Generative Networks. PPGNs are composed of 1) a generator network G that is capable of drawing a wide range of image types and 2) a replaceable condition network C that tells the generator what to draw. We demonstrate the generation of images conditioned on a class (when C is an ImageNet or MIT Places classification network) and also conditioned on a caption (when C is an image captioning network). Our method also improves the state of the art of Multifaceted Feature Visualization [40], which generates the set of synthetic inputs that activate a neuron in order to better understand how deep neural networks operate. Finally, we show that our model performs reasonably well at the task of image inpainting. While image models are used in this paper, the approach is modality-agnostic and can be applied to many types of data.
机译:生成高分辨率,逼真的图像一直是机器学习的长期目标。最近,Nguyen等人。 [37]展示了一种有趣的方式来合成新颖的图像,方法是在生成器网络的潜在空间中执行梯度上升,以最大化单独分类器网络中一个或多个神经元的激活。在本文中,我们通过在潜在代码上引入一个额外的先验来扩展该方法,从而改善了样本质量和样本多样性,从而建立了一个最新的生成模型,该模型可以生成高分辨率(227×227)的高质量图像比以前的生成模型要好,并且对所有1000种ImageNet类别都适用。此外,我们提供了有关激活最大化方法的统一概率解释,并将模型的一般类称为即插即用生成网络。 PPGN由1)能够绘制各种图像类型的生成器网络G和2)告知生成器绘制内容的可替换条件网络C组成。我们演示了以类为条件的图像的生成(当C为ImageNet或MIT Places分类网络时),并且还以字幕为条件(当C为图像字幕网络时)。我们的方法还改善了多面特征可视化技术的水平[40],它生成了一组激活神经元的合成输入,以便更好地了解深度神经网络的运行方式。最后,我们证明了我们的模型在图像修复任务上表现良好。尽管本文使用图像模型,但该方法是模态不可知的,可以应用于多种类型的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号