首页> 外文会议>International joint conference on natural language processing >A Semi-Supervised Stable Variational Network for Promoting Replier-Consistency in Dialogue Generation
【24h】

A Semi-Supervised Stable Variational Network for Promoting Replier-Consistency in Dialogue Generation

机译:一个半监督稳定的变分网络,用于在对话生成中推广Replier-Consive

获取原文

摘要

Neural sequence-to-sequence models for dialog systems suffer from the problem of favoring uninformative and non replier-specific responses due to lacking of the global and relevant information guidance. The existing methods model the generation process by leveraging the neural variational network with simple Gaussian. However, the sampled information from latent space usually becomes useless due to the KL divergence vanishing issue, and the highly abstractive global variables easily dilute the personal features of replier, leading to a non replier-specific response. Therefore, a novel Semi-Supervised Stable Variational Network (SSVN) is proposed to address these issues. We use a unit hypersperical distribution, namely the von Mises-Fisher (vMF), as the latent space of a semi-supervised model, which can obtain the stable KL performance by setting a fixed variance and hence enhance the global information representation. Meanwhile, an unsupervised extractor is introduced to automatically distill the replier-tailored feature which is then injected into a supervised generator to encourage the replier-consistency. Experimental results on two large conversation datasets show that our model outperforms the competitive baseline models significantly, and can generate diverse and replier-specific responses.
机译:由于缺乏全球和相关信息指导,对话系统的神经序列对对话系统的序列模型遭受了有利于未经信息和非复制特定响应的问题。现有方法通过利用具有简单高斯的神经变差网络来模拟生成过程。但是,由于KL发散问题,来自潜在空间的采样信息通常变得无用,并且高度抽象的全局变量很容易稀释Replier的个人功能,导致非Repropier特定的响应。因此,提出了一种新颖的半监督稳定变分网络(SSVN)来解决这些问题。我们使用单位的超级分布,即Von Mises-Fisher(VMF),作为半监督模型的潜在空间,可以通过设置固定方差来获得稳定的KL性能,从而增强全局信息表示。同时,引入了无监督的提取器以自动提取蒸馏器,然后将其注入监控发电机以鼓励重新换成。两个大型对话数据集的实验结果表明,我们的模型显着优于竞争性基线模型,可以产生多样化和特定的响应。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号