首页> 外文会议>International conference on computational linguistics >Abstractive Unsuperviscd Multi-Document Summarization using Paraphrastic Sentence Fusion
【24h】

Abstractive Unsuperviscd Multi-Document Summarization using Paraphrastic Sentence Fusion

机译:使用副词句子融合的抽象无监督多文档摘要

获取原文

摘要

In this work, we aim at developing an unsupervised abstractive summarization system in the multi-document setting. We design a paraphrastic sentence fusion model which jointly performs sentence fusion and paraphrasing using skip-gram word embedding model at the sentence level. Our model improves the information coverage and at the same time abstractiveness of the generated sentences. We conduct our experiments on the human-generated multi-sentence compression datasets and evaluate our system on several newly proposed Machine Translation (MT) evaluation metrics. Furthermore, we apply our sentence level model to implement an abstractive multi-document summarization system where documents usually contain a related set of sentences. We also propose an optimal solution for the classical summary length limit problem which was not addressed in the past research. For the document level summary, we conduct experiments on the datasets of two different domains (e.g., news article and user reviews) which are well suited for multi-document abstractive summarization. Our experiments demonstrate that the methods bring significant improvements over the state-of-the-art methods.
机译:在这项工作中,我们旨在在多文档环境中开发无监督的抽象摘要系统。我们设计了一种准短语融合模型,该模型在句子级别使用skip-gram词嵌入模型共同执行句子融合和释义。我们的模型提高了信息覆盖率,同时提高了所生成句子的抽象性。我们对人为产生的多句子压缩数据集进行了实验,并根据几种新提出的机器翻译(MT)评估指标对我们的系统进行了评估。此外,我们使用句子级别模型来实现抽象的多文档摘要系统,其中文档通常包含一组相关的句子。我们还为经典的摘要长度限制问题提出了一种最佳解决方案,该问题在过去的研究中并未得到解决。对于文档级别的摘要,我们对两个非常适合进行多文档抽象摘要的不同领域(例如新闻文章和用户评论)的数据集进行实验。我们的实验表明,与现有技术相比,这些方法带来了显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号