Implicit Deep Latent Variable Models for Text Generation

机译：文本生成隐含深度潜变量模型

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Deep latent variable models (LVM) such as variational auto-encoder (VAE) have recently played an important role in text generation. One key factor is the exploitation of smooth latent structures to guide the generation. However, the representation power of VAEs is limited due to two reasons: (1) the Gaussian assumption is often made on the variational posteriors: and meanwhile (2) a notorious "posterior collapse" issue occurs. In this paper, we advocate sample-based representations of variational distributions for natural language, leading to implicit latent features, which can provide flexible representation power compared with Gaussian-based posteriors. We further develop an LVM to directly match the aggregated posterior to the prior. It can be viewed as a natural extension of VAEs with a regularization of maximizing mutual information, mitigating the "posterior collapse" issue. We demonstrate the effectiveness and versatility of our models in various text generation scenarios, including language modeling, unaligned style transfer, and dialog response generation. The source code to reproduce our experimental results is available on GitHub~1.

机译：深潜变量模型（LVM）如变自动编码器（VAE）最近在玩文字产生了重要的作用。一个关键因素是光滑潜结构的剥削引导代。然而，VAES的表示功率被限制由于两个原因：（1）高斯假设通常由在变后验：同时（2）一个臭名昭著“后崩溃”问题时。在本文中，我们提倡自然语言变分布基于采样表示，导致隐含潜在功能，基于高斯后验相比，可以提供灵活的表现力。我们进一步发展的LVM到汇总后到前直接匹配。它可以被看作是VAES的自然延伸最大化互信息，减轻了“后崩溃”问题的正规化。我们证明我们的模型在不同的文本生成方案，包括语言建模，对齐风格转移和对话响应生成的有效性和通用性。源代码复制我们的实验结果可以在GitHub〜1。

著录项

来源
《International joint conference on natural language processing》|2019年|cxxxviii p. 3882-4524|共11页
会议地点
作者
Le Fang; Chunyuan Li; Jianfeng Gao; Wen Dong; Changyou Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model [J] . Hao-Tong Ye, Kai-Lin Lo, Shang-Yu Su, Computer speech and language . 2020,第Sepa期

机译：具有深度注意力潜在变量模型的知识接地响应生成
2. The cause-effect relation of latent variables in scientific multi-text reading comprehension: Testing the sequential mediation model [J] . Hsiao-Hui Lin, Yuh-Tsuen Tzeng, Hsueh-Chih Chen, Reading & Writing . 2020,第1期

机译：科学多文读取理解潜在变量的原因效应关系：测试顺序调解模型
3. Deep Generative Model with Supervised Latent Space for Text Classification [J] . Maciej Jankowski MATEC Web of Conferences . 2019,第12期

机译：具有监督潜在空间的深度生成模型用于文本分类
4. Implicit Deep Latent Variable Models for Text Generation [C] . Le Fang, Chunyuan Li, Jianfeng Gao, International joint conference on natural language processing;Conference on empirical methods in natural language processing . 2019

机译：用于文本生成的隐式深潜变量模型
5. Deep Latent-variable Models for Natural Language Understanding and Generation [D] . Shen, Dinghan . 2020

机译：深度潜在的自然语言理解模型
6. Interpretable Probabilistic Latent Variable Models for Automatic Annotation of Clinical Text [O] . Alexander Kotov, Mehedi Hasan, April Carcone, 2015

机译：可解释的概率潜在变量模型用于临床文本的自动注释
7. Implicit Deep Latent Variable Models for Text Generation [O] . Le Fang, Chunyuan Li, Jianfeng Gao, 2019

机译：文本生成隐含深度潜变量模型

Implicit Deep Latent Variable Models for Text Generation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅