首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Vocabulary Pyramid Network: Multi-Pass Encoding and Decoding with Multi-Level Vocabularies for Response Generation
【24h】

Vocabulary Pyramid Network: Multi-Pass Encoding and Decoding with Multi-Level Vocabularies for Response Generation

机译:词汇金字塔网络:多级词汇与多级词汇进行多级编码和解码

获取原文

摘要

We study the task of response generation. Conventional methods employ a fixed vocabulary and one-pass decoding, which not only make them prone to safe and general responses but also lack further refining to the first generated raw sequence. To tackle the above two problems, we present a Vocabulary Pyramid Network (VPN) which is able to incorporate multi-pass encoding and decoding with multi-level vocabularies into response generation. Specifically, the dialogue input and output are represented by multi-level vocabularies which are obtained from hierarchical clustering of raw words. Then, multi-pass encoding and decoding are conducted on the multilevel vocabularies. Since VPN is able to leverage rich encoding and decoding information with multi-level vocabularies, it has the potential to generate better responses. Experiments on English Twitter and Chinese Wei-bo datasets demonstrate that VPN remarkably outperforms strong baselines.
机译:我们研究了反应生成的任务。常规方法采用固定的词汇和一次通过解码,这不仅使其容易易于安全和一般的反应,而且还缺乏进一步改进第一生成的原始序列。为了解决上述两个问题,我们介绍了一种词汇金字塔网络(VPN),它能够将多级词汇表中的多级编码和解码结合到响应生成中。具体地,对话输入和输出由多级词汇表表示,该多级词汇表是从原始词的分层聚类获得的。然后,在多级词汇表上进行多通编码和解码。由于VPN能够利用具有多级词汇表的丰富编码和解码信息,因此它具有产生更好的响应。英语推特和中国魏波数据集的实验证明了VPN非常优于强大的基线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号