Improved Variational Autoencoders for Text Modeling using Dilated Convolutions

Zichao Yang; Zhiting Hu; Ruslan Salakhutdinov; Taylor Berg-Kirkpatrick

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Improved Variational Autoencoders for Text Modeling using Dilated Convolutions

【24h】

Improved Variational Autoencoders for Text Modeling using Dilated Convolutions

机译：改进的变分自动编码器，用于使用膨胀卷积的文本建模

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent work on generative text modeling has found that variational autoencoders (VAE) with LSTM decoders perform worse than simpler LSTM language models (Bowman et al., 2015). This negative result is so far poorly understood, but has been attributed to the propensity of LSTM decoders to ignore conditioning information from the encoder. In this paper, we experiment with a new type of decoder for VAE: a dilated CNN. By changing the decoder’s dilation architecture, we control the size of context from previously generated words. In experiments, we find that there is a trade-off between contextual capacity of the decoder and effective use of encoding information. We show that when carefully managed, VAEs can outperform LSTM language models. We demonstrate perplexity gains on two datasets, representing the first positive language modeling result with VAE. Further, we conduct an in-depth investigation of the use of VAE (with our new decoding architecture) for semi-supervised and unsupervised labeling tasks, demonstrating gains over several strong baselines.

机译：有关生成文本建模的最新研究发现，带有LSTM解码器的变体自动编码器（VAE）的性能比简单的LSTM语言模型差（Bowman等，2015）。到目前为止，这种负面结果了解得很少，但归因于LSTM解码器倾向于忽略来自编码器的条件信息。在本文中，我们尝试使用一种新型的VAE解码器：膨胀的CNN。通过更改解码器的扩散架构，我们可以控制以前生成的单词的上下文大小。在实验中，我们发现解码器的上下文容量和编码信息的有效使用之间存在折衷。我们表明，经过精心管理，VAE可以胜过LSTM语言模型。我们展示了两个数据集上的困惑感，这代表了VAE的第一个积极的语言建模结果。此外，我们针对半监督和无监督的标记任务对VAE（使用我们的新解码体系结构）的使用进行了深入研究，展示了在多个强大基准上的收益。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第1期|共10页
作者
Zichao Yang; Zhiting Hu; Ruslan Salakhutdinov; Taylor Berg-Kirkpatrick;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Toxic gas release modeling for real-time analysis using variational autoencoder with convolutional neural networks [J] . Na Jonggeol, Jeon Kyeongwoo, Lee Won Bo Chemical Engineering Science . 2018,第期

机译：具有卷积神经网络的变分自动化器实时分析的有毒气体释放建模
2. Long Text Classification Algorithm Using a Hybrid Model of Bidirectional Encoder Representation from Transformers-Hierarchical Attention Networks-Dilated Convolutions Network [J] . ZHAO Yuanyuan, GAO Shining, LIU Yang, 东华大学学报（英文版） . 2021,第004期

机译：使用变压器 - 分层关注网络扩展卷轴网络的双向编码器表示的混合模型的长文本分类算法
3. Fully convolutional network with dilated convolutions for handwritten text line segmentation [J] . Guillaume Renton, Yann Soullard, Clément Chatelain, International Journal on Document Analysis and Recognition . 2018,第3期

机译：带有卷积的全卷积网络用于手写文本行分割
4. Improved Variational Autoencoders for Text Modeling using Dilated Convolutions [C] . Zichao Yang, Zhiting Hu, Ruslan Salakhutdinov, International Conference on Machine Learning . 2018

机译：使用扩张卷积改进的文本建模改进的变形式自动码器
5. Variational Autoencoder Based Estimation of Distribution Algorithms and Applications to Individual Based Ecosystem Modeling Using Ecosim [D] . Bhattacharjee, Sourodeep. 2019

机译：基于分析算法的分析算法和应用于各个生态系统建模的分布算法和应用
6. Fault Diagnosis of Rolling Bearings Based on a Residual Dilated Pyramid Network and Full Convolutional Denoising Autoencoder [O] . Hongmei Shi, Jingcheng Chen, Jin Si, 2020

机译：基于残余扩张金字塔网络的滚动轴承故障诊断及全卷积去噪自动化
7. Improving Variational Autoencoder for Text Modelling with Timestep-Wise Regularisation [O] . Ruizhe Li, Xiao Li, Guanyi Chen, 2020

机译：用TimeStep-Wise Raloundization改进文本建模的变形Autiachoder

Improved Variational Autoencoders for Text Modeling using Dilated Convolutions

摘要

著录项

相似文献

相关主题

期刊订阅