Wake-Sleep Variational Autoencoders for Language Modeling

机译：用于语言建模的唤醒变形AutoEncoders

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Variational Autoencoders (VAEs) are known to easily suffer from the KL-vanishing problem when combining with powerful autoregressive models like recurrent neural networks (RNNs), which prohibits their wide application in natural language processing. In this paper, we tackle this problem by tearing the training procedure into two steps: learning effective mechanisms to encode and decode discrete tokens (wake step) and generalizing meaningful latent variables by reconstructing dreamed encodings (sleep step). The training pattern is similar to the wake-sleep algorithm: these two steps are trained alternatively until an equilibrium is achieved. We test our model in a language modeling task. The results demonstrate significant improvement over the current state-of-the-art latent variable models.

机译：已知变形AutoEncoders（VAES）在与经常性神经网络（RNNS）这样的强大自回归模型相结合时容易遭受KL消失问题，这禁止他们广泛应用于自然语言处理。在本文中，我们通过将培训程序撕成两个步骤来解决这个问题：学习用于编码和解码离散令牌（唤醒步骤）的有效机制并通过重建梦想的编码（睡眠步骤）来概括有意义的潜变量。训练模式类似于唤醒睡眠算法：这两个步骤可选地训练，直到实现平衡。我们在语言建模任务中测试我们的模型。结果表明，对当前最先进的潜在变量模型的显着改进。

著录项

来源
《International Conference on Neural Information Processing》|2017年|936p|共10页
会议地点
作者
Xiaoyu Shen; Hui Su; Shuzi Niu; Dietrich Klakow;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP183-53;
关键词
Variational Autoencoder; Wake-sleep algorithm; Language modeling; Latent variable;

机译：变形式自动化器;唤醒睡眠算法;语言建模;潜在变量;

相似文献

外文文献
中文文献
专利

1. A just-in-time modeling approach for multimode soft sensor based on Gaussian mixture variational autoencoder [J] . Fan Guo, Bing Wei, Biao Huang Computers & Chemical Engineering . 2021,第Mara期

机译：基于高斯混合变分自动化器的多模软传感器的一次刚性建模方法
2. Nested variational autoencoder for topic modelling on microtexts with word vectors [J] . Trinh Trung, Quan Tho, Mai Trung Expert Systems . 2021,第2期

机译：嵌套变分AutoEncoder主题建模与单词向量的Microtexts
3. Deep generative models in inversion: The impact of the generator's nonlinearity and development of a new approach based on a variational autoencoder [J] . Lopez-Alvis J., Nguyen F., Hermans T., Computers & geosciences . 2021,第Jula期

机译：反演中的深度生成模型：基于变分自动化器的发电机非线性和新方法的影响
4. Wake-Sleep Variational Autoencoders for Language Modeling [C] . Xiaoyu Shen, Hui Su, Shuzi Niu, International conference on neural information processing . 2017

机译：唤醒睡眠变体自动编码器，用于语言建模
5. Variational Autoencoder Based Estimation of Distribution Algorithms and Applications to Individual Based Ecosystem Modeling Using Ecosim [D] . Bhattacharjee, Sourodeep. 2019

机译：基于分析算法的分析算法和应用于各个生态系统建模的分布算法和应用
6. Interpretable factor models of single-cell RNA-seq via variational autoencoders [O] . Valentine Svensson, Adam Gayoso, Nir Yosef, -1

机译：通过可变自动编码器可解释的单细胞RNA-seq因子模型
7. Deep generative models in inversion: The impact of the generator's nonlinearity and development of a new approach based on a variational autoencoder [O] . Jorge Lopez-Alvis, Eric Laloy, Frédéric Nguyen, 2021

机译：反演深度生成模型：基于变分自动化器的新方法的影响：基于变化自身的新方法的影响

Wake-Sleep Variational Autoencoders for Language Modeling

摘要

著录项

相似文献

相关主题

期刊订阅