首页> 外文期刊>Computer speech and language >Generating diverse conversation responses by creating and ranking multiple candidates
【24h】

Generating diverse conversation responses by creating and ranking multiple candidates

机译:通过创建和排列多个候选者来生成各种对话响应

获取原文
获取原文并翻译 | 示例
           

摘要

This paper introduces our systems built for Track 2 of Dialog System Technology Challenge 7 (DSTC7). This challenge track aimed to evaluate the response generation methods using fully data-driven conversation models in a knowledge-grounded setting, where textual facts were provided as the knowledge for each context-response pair. The sequence-to-sequence models have achieved impressive results in machine translation and have also been widely used for end-to-end generative conversation modelling. However, they tended to output dull and repeated responses in previous studies. Our work aims to promote the diversity of end-to-end conversation response generation by adopting a two-stage pipeline. 1) Create multiple responses for an input context together with its textual facts. At this stage, two different models are designed, i.e., a variational generative (VariCen) model and a retrieval-based (Retrieval) model. 2) Rank and return the most relevant response by training a topic coherence discrimination (TCD) model for calculating ranking scores. In our experiments, we demonstrated the effectiveness of the response ranking strategy and the external textual knowledge for generating better responses. According to the official evaluation results, our Retrieval and VariCen systems ranked first and second respectively among all participant systems on Entropy metrics which measured the objective diversity of generated responses. Besides, the VariCen system ranked second on NIST and METEOR metrics which measured the objective quality of generated responses.
机译:本文介绍了我们为对话系统技术挑战7(DSTC7)的第2轨构建的系统。该挑战赛旨在以知识为基础的环境中,使用完全数据驱动的对话模型来评估响应生成方法,其中,将文本事实作为每个上下文响应对的知识提供。序列到序列模型在机器翻译中取得了令人印象深刻的结果,并且还广泛用于端到端生成对话建模。但是,在以前的研究中,他们倾向于产生沉闷和反复的反应。我们的工作旨在通过采用两阶段管道来促进端到端对话响应生成的多样性。 1)为输入上下文及其文本事实创建多个响应。在此阶段,设计了两个不同的模型,即变体生成(VariCen)模型和基于检索的(Retrieval)模型。 2)通过训练主题一致性鉴别(TCD)模型来计算排名分数,从而对最相关的响应进行排名并返回最相关的响应。在我们的实验中,我们证明了反应排名策略和外部文本知识对产生更好反应的有效性。根据官方评估结果,我们的Retrieval和VariCen系统在所有参与者系统的熵指标上分别排名第一和第二,该指标衡量了所生成响应的客观多样性。此外,VariCen系统在衡量生成的响应的客观质量的NIST和METEOR指标上排名第二。

著录项

  • 来源
    《Computer speech and language》 |2020年第7期|101071.1-101071.11|共11页
  • 作者单位

    National Engineering Laboratory for Speech and Language Information Processing University of Science and Technology of China Hefei PR China;

    Department of Electrical and Computer Engineering Queen's University Kingston Canada;

    National Engineering Laboratory for Speech and Language Information Processing University of Science and Technology of China Hefei PR China iFLYTEK Research Hefei PR China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    End-to-end; Conversation response generation; Variational autoencoders; Diversity;

    机译:端到端会话响应生成;可变自动编码器;多元化;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号