首页> 外文会议>Pacific-Rim conference on multimedia >Coupled Learning for Image Generation and Latent Representation Inference Using MMD
【24h】

Coupled Learning for Image Generation and Latent Representation Inference Using MMD

机译:使用MMD进行图像生成和潜在表示推理的耦合学习

获取原文

摘要

For modeling the data distribution or the latent representation distribution in the image domain, deep learning methods such as the variational autoencoder (VAE) and the generative adversarial network (GAN) have been proposed. However, despite its capability of modeling these two distributions, VAE tends to learn less meaningful latent representations; GAN can only model the data distribution using the challenging and unstable adversarial training. To address these issues, we propose an unsupervised learning framework to perform coupled learning of these two distributions based on kernel maximum mean discrepancy (MMD). Specifically, the proposed framework consists of (1) an inference network and a generation network for mapping between the data space and the latent space, and (2) a latent tester and a data tester for performing two-sample tests in these two spaces, respectively. On one hand, we perform a two-sample test between stochastic representations from the prior distribution and inferred representations from the inference network. On the other hand, we perform a two-sample test between the real data and generated data. In addition, we impose structural reg-ularization that the two networks are inverses of each other, so that the learning of these two distributions can be coupled. Experimental results on benchmark image datasets demonstrate that the proposed framework is competitive on image generation and latent representation inference of images compared with representative approaches.
机译:为了对图像域中的数据分布或潜在表示分布进行建模,已经提出了深度学习方法,例如变分自动编码器(VAE)和生成对抗网络(GAN)。但是,尽管能够对这两种分布进行建模,但VAE倾向于学习不太有意义的潜在表示。 GAN只能使用具有挑战性和不稳定的对抗训练来对数据分布进行建模。为了解决这些问题,我们提出了一种无监督的学习框架,可以基于内核最大均值差(MMD)对这两个分布进行耦合学习。具体而言,所提出的框架包括(1)用于在数据空间和潜在空间之间进行映射的推理网络和生成网络,以及(2)用于在这两个空间中执行两次样本测试的潜在测试器和数据测试器,分别。一方面,我们在先验分布的随机表示与推理网络的推理表示之间执行了两个样本的测试。另一方面,我们在真实数据和生成的数据之间执行了两个样本的测试。另外,我们强加结构正则化,即两个网络彼此相反,以便可以耦合这两个分布的学习。在基准图像数据集上的实验结果表明,与代表性方法相比,所提出的框架在图像生成和图像的潜在表示推断方面具有竞争力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号