首页> 外文会议>International Conference on Artificial Neural Networks >Training Discriminative Models to Evaluate Generative Ones
【24h】

Training Discriminative Models to Evaluate Generative Ones

机译:训练判别模型以评估生成模型

获取原文

摘要

Generative models are known to be difficult to assess. Recent works, especially on generative adversarial networks (GANs), produce good visual samples of varied categories of images. However, the validation of their quality is still difficult to define and there is no existing agreement on the best evaluation process. This paper aims at making a step toward an objective evaluation process for generative models. It presents a new method to assess a trained generative model by evaluating the test accuracy of a classifier trained with generated data. The test set is composed of real images. Therefore, The classifier accuracy is used as a proxy to evaluate if the generative model fit the true data distribution. By comparing results with different generated datasets we are able to classify and compare generative models. The motivation of this approach is also to evaluate if generative models can help discriminative neural networks to learn, i.e., measure if training on generated data is able to make a model successful at testing on real settings. Our experiments compare different generators from the Variational Auto-Encoders (VAE) and Generative Adversarial Network (GAN) frameworks on MNIST and fashion MNIST datasets. Our results show that none of the generative models is able to replace completely true data to train a discriminative model. But they also show that the initial GAN and WGAN are the best choices to generate on MNIST database (Modified National Institute of Standards and Technology database) and fashion MNIST database.
机译:已知生成模型很难评估。最近的工作,尤其是关于生成对抗网络(GAN)的工作,产生了各种图像类别的良好视觉样本。但是,对其质量的验证仍然很难定义,并且关于最佳评估过程尚无共识。本文旨在朝着生成模型的客观评估过程迈出一步。它提出了一种通过评估使用生成的数据训练的分类器的测试准确性来评估训练的生成模型的新方法。测试集由真实图像组成。因此,将分类器准确性用作评估生成模型是否符合真实数据分布的代理。通过将结果与不同的生成数据集进行比较,我们能够对生成的模型进行分类和比较。这种方法的动机还在于评估生成模型是否可以帮助判别性神经网络学习,即衡量对生成数据的训练是否能够使模型在真实环境下成功测试。我们的实验在MNIST和时尚MNIST数据集上比较了来自变分自动编码器(VAE)和生成对抗网络(GAN)框架的不同生成器。我们的结果表明,没有一个生成模型能够取代完全真实的数据来训练判别模型。但是他们也表明,最初的GAN和WGAN是在MNIST数据库(改良的美国国家标准技术研究院数据库)和时尚MNIST数据库上生成的最佳选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号