首页> 外文会议>Annual conference on Neural Information Processing Systems >Variational Autoencoder for Deep Learning of Images, Labels and Captions
【24h】

Variational Autoencoder for Deep Learning of Images, Labels and Captions

机译:变形AutoEncoder,用于深入学习图像,标签和标题

获取原文

摘要

A novel variational autoencoder is developed to model images, as well as associated labels or captions. The Deep Generative Deconvolutional Network (DGDN) is used as a decoder of the latent image features, and a deep Convolutional Neural Network (CNN) is used as an image encoder; the CNN is used to approximate a distribution for the latent DGDN features/code. The latent code is also linked to generative models for labels (Bayesian support vector machine) or captions (recurrent neural network). When predicting a label/caption for a new image at test, averaging is performed across the distribution of latent codes; this is computationally efficient as a consequence of the learned CNN-based encoder. Since the framework is capable of modeling the image in the presence/absence of associated labels/captions, a new semi-supervised setting is manifested for CNN learning with images; the framework even allows unsupervised CNN learning, based on images alone.
机译:开发了一种新颖的变化性AutoEncoder以模拟图像,以及相关的标签或标题。深生成的解卷积网络(DGDN)用作潜像特征的解码器,深卷积神经网络(CNN)用作图像编码器; CNN用于近似潜在DGDN特征/代码的分布。潜在代码也与标签(贝叶斯支持向量机)或标题(经常性神经网络)的生成模型相关联。当在测试时预测用于新图像的标签/标题时,在潜在码的分布中执行平均;这是根据学习的基于CNN的编码器的结果计算的。由于框架能够在相关标签/标题的存在/不存在中建模图像,因此具有图像的CNN学习的新的半监督设置;该框架甚至允许根据单独的图像允许无监督的CNN学习。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号