首页> 外文会议>Annual conference on Neural Information Processing Systems >Variational Autoencoder for Deep Learning of Images, Labels and Captions
【24h】

Variational Autoencoder for Deep Learning of Images, Labels and Captions

机译:用于图像,标签和标题深度学习的变体自动编码器

获取原文

摘要

A novel variational autoencoder is developed to model images, as well as associated labels or captions. The Deep Generative Deconvolutional Network (DGDN) is used as a decoder of the latent image features, and a deep Convolutional Neural Network (CNN) is used as an image encoder; the CNN is used to approximate a distribution for the latent DGDN features/code. The latent code is also linked to generative models for labels (Bayesian support vector machine) or captions (recurrent neural network). When predicting a label/caption for a new image at test, averaging is performed across the distribution of latent codes; this is computationally efficient as a consequence of the learned CNN-based encoder. Since the framework is capable of modeling the image in the presence/absence of associated labels/captions, a new semi-supervised setting is manifested for CNN learning with images; the framework even allows unsupervised CNN learning, based on images alone.
机译:开发了新颖的变体自动编码器以对图像以及相关的标签或标题建模。深度生成反卷积网络(DGDN)用作潜像特征的解码器,深度卷积神经网络(CNN)用作图像编码器; CNN用于估计潜在DGDN功能/代码的分布。潜在代码还链接到标签(贝叶斯支持向量机)或标题(递归神经网络)的生成模型。当预测要测试的新图像的标签/标题时,将对潜在代码的分布进行平均。由于学习了基于CNN的编码器,因此计算效率很高。由于该框架能够在存在/不存在相关标签/标题的情况下对图像进行建模,因此为CNN学习图像提供了一种新的半监督设置。该框架甚至允许仅基于图像进行无监督的CNN学习。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号