Avoiding Latent Variable Collapse with Generative Skip Models

Adji B. Dieng; Yoon Kim; Alexander M. Rush; David M. Blei

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Avoiding Latent Variable Collapse with Generative Skip Models

【24h】

Avoiding Latent Variable Collapse with Generative Skip Models

机译：使用生成跳过模型避免潜在的变量崩溃

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Variational autoencoders (VAEs) learn distributions of high-dimensional data. They model data with a deep latent-variable model and then fit the model by maximizing a lower bound of the log marginal likelihood. VAEs can capture complex distributions, but they can also suffer from an issue known as "latent variable collapse," especially if the likelihood model is powerful. Specifically, the lower bound involves an approximate posterior of the latent variables; this posterior "collapses" when it is set equal to the prior, i.e., when the approximate posterior is independent of the data. While VAEs learn good generative models, latent variable collapse prevents them from learning useful representations. In this paper, we propose a simple new way to avoid latent variable collapse by including skip connections in our generative model; these connections enforce strong links between the latent variables and the likelihood function. We study generative skip models both theoretically and empirically. Theoretically, we prove that skip models increase the mutual information between the observations and the inferred latent variables. Empirically, we study images (MNIST and Omniglot) and text (Yahoo). Compared to existing VAE architectures, we show that generative skip models maintain similar predictive performance but lead to less collapse and provide more meaningful representations of the data.

机译：可变自动编码器（VAE）学习高维数据的分布。他们使用深潜变量模型对数据建模，然后通过最大化对数边际可能性的下限来拟合模型。 VAE可以捕获复杂的分布，但是它们也可能遭受称为“潜在变量崩溃”的问题，尤其是在可能性模型强大的情况下。具体而言，下限涉及潜在变量的近似后验;当它等于先验时，即当近似后验与数据无关时，该后验“崩溃”。当VAE学习好的生成模型时，潜在变量崩溃会阻止他们学习有用的表示形式。在本文中，我们提出了一种简单的新方法，通过在生成模型中包含跳过连接来避免潜在变量崩溃。这些连接在潜在变量和似然函数之间建立了强有力的联系。我们在理论和经验上研究生成跳跃模型。从理论上讲，我们证明了跳跃模型增加了观测值和推断的潜在变量之间的相互信息。根据经验，我们研究图像（MNIST和Omniglot）和文本（Yahoo）。与现有的VAE架构相比，我们显示生成跳过模型保持相似的预测性能，但导致更少的崩溃，并提供了更有意义的数据表示形式。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第12期|共9页
作者
Adji B. Dieng; Yoon Kim; Alexander M. Rush; David M. Blei;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Bayesian latent variable collapsing model for detecting rare variant interaction effect in twin study [J] . HeL., Sillanp??M.J., RipattiS., Genetic epidemiology. . 2014,第4期

机译：用于双生子研究的罕见变体相互作用效应的贝叶斯潜变量崩溃模型
2. A general class of latent variable models for ordinal manifest variables with covariate effects on the manifest and latent variables [J] . Irini Moustaki The British journal of mathematical and statistical psychology . 2003,第2期

机译：序数清单变量的潜在变量模型的一般类别，对清单和潜在变量具有协变量影响
3. Latent variable models with nonparametric interaction effects of latent variables [J] . SongX., LuZ., FengX. Statistics in medicine . 2014,第10期

机译：具有潜在变量的非参数交互作用的潜在变量模型
4. BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling [C] . Lars Maaloe, Marco Fraccaro, Valentin Lievin, Conference on Neural Information Processing Systems . 2020

机译：Biva：生成型号的潜在变量非常深的层次结构
5. Learning Generative Models Using Structured Latent Variables. [D] . Tang, Yichuan. 2015

机译：使用结构化潜在变量学习生成模型。
6. Unsupervised low latency anomaly detection of algorithmically generated domain names by generative probabilistic modeling [O] . Jayaram Raghuram, David J. Miller, George Kesidis 2014

机译：通过生成概率模型对算法生成的域名进行无监督低延迟的异常检测
7. Latent-Variable Generative Models for Data-Efficient Text Classification [O] . Xiaoan Ding, Kevin Gimpel 2019

机译：数据有效文本分类的潜在变量生成模型

Avoiding Latent Variable Collapse with Generative Skip Models

摘要

著录项

相似文献

相关主题

期刊订阅