Reinforcing Coherence for Sequence to Sequence Model in Dialogue Generation

机译：在对话生成中加强序列序列模型的相干性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sequence to sequence (Seq2Seq) approach has gained great attention in the field of single-turn dialogue generation. However, one serious problem is that most existing Seq2Seq based models tend to generate common responses lacking specific meanings. Our analysis show that the underlying reason is that Seq2Seq is equivalent to optimizing Kullback-Leibler (KL) divergence, thus does not penalize the case whose generated probability is high while the true probability is low. However, the true probability is unknown, which poses challenges for tackling this problem. Inspired by the fact that the coherence (i.e. similarity) between post and response is consistent with human evaluation, we hypothesize that the true probability of a response is proportional to the coherence degree. The coherence scores are then used as the reward function in a reinforcement learning framework to penalize the case whose generated probability is high while the true probability is low. Three different types of coherence models, including an unlearned similarity function, a pretrained semantic matching function, and an end-to-end dual learning architecture, are proposed in this paper. Experimental results on both Chinese Weibo dataset and English Subtitle dataset show that the proposed models produce more specific and meaningful responses, yielding better performances against Seq2Seq models in terms of both metric-based and human evaluations.

机译：序列（SEQ2Seq）方法在单转对话的领域中获得了很大的关注。然而，一个严重的问题是基于大多数现有的SEQ2Seq模型倾向于产生缺乏特定含义的共同响应。我们的分析表明，潜在的原因是SEQ2Seq相当于优化Kullback-Leibler（KL）发散，因此不会惩罚产生概率高的情况，而真正的概率低。然而，真正的概率是未知的，这造成了解决这个问题的挑战。受到职位和响应之间的相干性（即相似性）与人类评估一致的事实，我们假设反应的真正概率与相干度成比例。然后将相干分数用作加强学习框架中的奖励功能，以惩罚产生的概率高的情况，而真正的概率低。本文提出了三种不同类型的相干模型，包括未经读数的相似函数，预先训练的语义匹配功能和端到端的双学习架构。中国微博数据集和英语字幕数据集的实验结果表明，拟议的模型会产生更具体和有意义的响应，从基于度量和人类评估方面产生更好的针对SEQ2SEQ模型的表现。

著录项

来源
《International Joint Conference on Artificial Intelligence》|2018年|4403-5141p|共7页
会议地点
作者
Hainan Zhang; Yanyan Lan; Jiafeng Guo; Jun Xu; Xueqi Cheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. A sequence to sequence model for dialogue generation with gated mixture of topics [J] . Zeng Hongwei, Liu Jun, Wang Meng, Neurocomputing . 2021,第MAYa21期

机译：对话模型的序列，主题门控混合
2. Mapping sensorimotor sequences to word sequences: A connectionist model of language acquisition and sentence generation [J] . Takac M., Benuskova L., Knott A. Cognition: International Journal of Cognitive Psychology . 2012,第2期

机译：将感觉运动序列映射到单词序列：语言习得和句子生成的连接模型
3. Conditional models: Coherence and inference through sequences of joint mass functions [J] . Miranda E, Zaffalon M Journal of Statistical Planning and Inference . 2010,第7期

机译：条件模型：通过联合质量函数的序列进行连贯和推断
4. Reinforcing Coherence for Sequence to Sequence Model in Dialogue Generation [C] . Hainan Zhang, Yanyan Lan, Jiafeng Guo, International Joint Conference on Artificial Intelligence . 2018

机译：在对话生成中加强序列序列模型的相干性
5. Scalable and Accurate Dialogue State Tracking via Hierarchical Sequence Generation [D] . Ren, Liliang. 2020

机译：通过分层序列生成可扩展和准确的对话状态跟踪
6. Generation and Classification of Activity Sequences for Spatiotemporal Modeling of Human Populations [O] . Albert M Lund, Ramkiran Gouripeddi, Julio C Facelli 2020

机译：人群时空建模活性序列的生成与分类
7. Retrieve and Refine: Improved Sequence Generation Models For Dialogue [O] . Jason Weston, Emily Dinan, Alexander Miller 2018

机译：检索和精炼：改进的对话序列生成模型

Reinforcing Coherence for Sequence to Sequence Model in Dialogue Generation

摘要

著录项

相似文献

相关主题

期刊订阅