Variational Autoencoder for Deep Learning of Images, Labels and Captions

机译：用于图像，标签和标题深度学习的变体自动编码器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel variational autoencoder is developed to model images, as well as associated labels or captions. The Deep Generative Deconvolutional Network (DGDN) is used as a decoder of the latent image features, and a deep Convolutional Neural Network (CNN) is used as an image encoder; the CNN is used to approximate a distribution for the latent DGDN features/code. The latent code is also linked to generative models for labels (Bayesian support vector machine) or captions (recurrent neural network). When predicting a label/caption for a new image at test, averaging is performed across the distribution of latent codes; this is computationally efficient as a consequence of the learned CNN-based encoder. Since the framework is capable of modeling the image in the presence/absence of associated labels/captions, a new semi-supervised setting is manifested for CNN learning with images; the framework even allows unsupervised CNN learning, based on images alone.

机译：开发了新颖的变体自动编码器以对图像以及相关的标签或标题建模。深度生成反卷积网络（DGDN）用作潜像特征的解码器，深度卷积神经网络（CNN）用作图像编码器; CNN用于估计潜在DGDN功能/代码的分布。潜在代码还链接到标签（贝叶斯支持向量机）或标题（递归神经网络）的生成模型。当预测要测试的新图像的标签/标题时，将对潜在代码的分布进行平均。由于学习了基于CNN的编码器，因此计算效率很高。由于该框架能够在存在/不存在相关标签/标题的情况下对图像进行建模，因此为CNN学习图像提供了一种新的半监督设置。该框架甚至允许仅基于图像进行无监督的CNN学习。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2016年|2360-2368|共9页
会议地点
作者
Yunchen Pu; Zhe Gan; Ricardo Henao; Xin Yuan; Chunyuan Li; Andrew Stevens; Lawrence Carint;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Remote sensing image captioning via Variational Autoencoder and Reinforcement Learning [J] . Shen Xiangqing, Liu Bing, Zhou Yong, Knowledge-Based Systems . 2020,第Sepa5期

机译：通过变化自动化器和强化学习遥感图像标题
2. A semi-supervised deep learning image caption model based on Pseudo Label and N-gram [J] . Cheng Cheng, Li Chunping, Han Youfang, International Journal of Approximate Reasoning . 2021,第Apra期

机译：基于伪标签和N-GRAM的半监督深层学习图像标题模型
3. Colorizing and Captioning Images Using Deep Learning Models and Deploying Them Via loT Deployment Tools [J] . Krishnamurthi Rajalakshmi, Maheshwari Raghav, Gulati Rishabh International journal of information retrieval research . 2020,第4期

机译：使用深度学习模型并通过批次部署工具部署它们的彩色和标题图像
4. Variational Autoencoder for Deep Learning of Images, Labels and Captions [C] . Yunchen Pu, Zhe Gan, Ricardo Henao, Annual conference on Neural Information Processing Systems . 2016

机译：变形AutoEncoder，用于深入学习图像，标签和标题
5. Generation of Humorous Caption for Cartoon Images Using Deep Learning [D] . Shanmuga Sundaram, Rajesh. 2018

机译：使用深度学习的卡通形象的幽默标题
6. Prediction of Potential miRNA–Disease Associations Through a Novel Unsupervised Deep Learning Framework with Variational Autoencoder [O] . Li Zhang, Xing Chen, Jun Yin 2019

机译：通过具有变分自动编码器的新型无监督深度学习框架预测潜在的miRNA-疾病关联
7. Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map [O] . Boeun Kim, Saim Shin, Hyedong Jung 2019

机译：使用标题注意图的基于变化的自动统计器的多个图像标题

Variational Autoencoder for Deep Learning of Images, Labels and Captions

摘要

著录项

相似文献

相关主题

期刊订阅