Implicit Deep Latent Variable Models for Text Generation

机译：用于文本生成的隐式深潜变量模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep latent variable models (LVM) such as variational auto-encoder (VAE) have recently played an important role in text generation. One key factor is the exploitation of smooth latent structures to guide the generation. However, the representation power of VAEs is limited due to two reasons: (1) the Gaussian assumption is often made on the variational posteriors: and meanwhile (2) a notorious "posterior collapse" issue occurs. In this paper, we advocate sample-based representations of variational distributions for natural language, leading to implicit latent features, which can provide flexible representation power compared with Gaussian-based posteriors. We further develop an LVM to directly match the aggregated posterior to the prior. It can be viewed as a natural extension of VAEs with a regularization of maximizing mutual information, mitigating the "posterior collapse" issue. We demonstrate the effectiveness and versatility of our models in various text generation scenarios, including language modeling, unaligned style transfer, and dialog response generation. The source code to reproduce our experimental results is available on GitHub~1.

机译：诸如可变自动编码器（VAE）之类的深潜变量模型（LVM）最近在文本生成中发挥了重要作用。一个关键因素是利用光滑的潜伏结构来指导发电。但是，由于两个原因，VAE的表示能力受到限制：（1）高斯假设通常基于变分后验;同时（2）发生了臭名昭著的“后倒塌”问题。在本文中，我们提倡基于样本的自然语言变分分布表示，从而导致隐含的潜在特征，与基于高斯的后验者相比，它可以提供灵活的表示能力。我们进一步开发了一个LVM，以使匹配的后验与前验直接匹配。可以将其视为VAE的自然扩展，通过最大化互信息的规则化来缓解“后崩溃”问题。我们演示了我们的模型在各种文本生成方案中的有效性和多功能性，包括语言建模，不对齐的样式转换和对话框响应生成。可重现我们的实验结果的源代码可在GitHub〜1上找到。

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing》|2019年|3944-3954|共11页
会议地点
作者
Le Fang; Chunyuan Li; Jianfeng Gao; Wen Dong; Changyou Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model [J] . Hao-Tong Ye, Kai-Lin Lo, Shang-Yu Su, Computer speech and language . 2020,第Sepa期

机译：具有深度注意力潜在变量模型的知识接地响应生成
2. The cause-effect relation of latent variables in scientific multi-text reading comprehension: Testing the sequential mediation model [J] . Hsiao-Hui Lin, Yuh-Tsuen Tzeng, Hsueh-Chih Chen, Reading & Writing . 2020,第1期

机译：科学多文读取理解潜在变量的原因效应关系：测试顺序调解模型
3. Deep Generative Model with Supervised Latent Space for Text Classification [J] . Maciej Jankowski MATEC Web of Conferences . 2019,第12期

机译：具有监督潜在空间的深度生成模型用于文本分类
4. Implicit Deep Latent Variable Models for Text Generation [C] . Le Fang, Chunyuan Li, Jianfeng Gao, International joint conference on natural language processing . 2019

机译：文本生成隐含深度潜变量模型
5. Deep Latent-variable Models for Natural Language Understanding and Generation [D] . Shen, Dinghan . 2020

机译：深度潜在的自然语言理解模型
6. Interpretable Probabilistic Latent Variable Models for Automatic Annotation of Clinical Text [O] . Alexander Kotov, Mehedi Hasan, April Carcone, 2015

机译：可解释的概率潜在变量模型用于临床文本的自动注释
7. Implicit Deep Latent Variable Models for Text Generation [O] . Le Fang, Chunyuan Li, Jianfeng Gao, 2019

机译：文本生成隐含深度潜变量模型

Implicit Deep Latent Variable Models for Text Generation

摘要

著录项

相似文献

相关主题

期刊订阅