首页> 外文期刊>Applied Intelligence >Multi-document summarization via submodularity
【24h】

Multi-document summarization via submodularity

机译:通过子模块进行多文档汇总

获取原文
获取原文并翻译 | 示例

摘要

Multi-document summarization is becoming an important issue in the Information Retrieval community. It aims to distill the most important information from a set of documents to generate a compressed summary. Given a set of documents as input, most of existing multi-document summarization approaches utilize different sentence selection techniques to extract a set of sentences from the document set as the summary. The submodularity hidden in the term coverage and the textual-unit similarity motivates us to incorporate this property into our solution to multi-document summarization tasks. In this paper, we propose a new principled and versatile framework for different multi-document summarization tasks using submodular functions (Nemhauser et al. in Math. Prog. 14(1):265–294, 1978) based on the term coverage and the textual-unit similarity which can be efficiently optimized through the improved greedy algorithm. We show that four known summarization tasks, including generic, query-focused, update, and comparative summarization, can be modeled as different variations derived from the proposed framework. Experiments on benchmark summarization data sets (e.g., DUC04-06, TAC08, TDT2 corpora) are conducted to demonstrate the efficacy and effectiveness of our proposed framework for the general multi-document summarization tasks.
机译:多文档摘要已成为信息检索社区中的重要问题。它旨在从一组文档中提取最重要的信息,以生成压缩的摘要。给定一组文档作为输入,大多数现有的多文档摘要方法利用不同的句子选择技术从文档集中提取一组句子作为摘要。隐藏在术语“覆盖率”和文本单位相似性中的次模块性促使我们将此属性纳入我们对多文档摘要任务的解决方案中。在本文中,我们基于术语“覆盖率”和“覆盖率”,提出了一种使用子模块函数针对不同的多文档摘要任务的新的原则性和通用框架(Nemhauser等人,在Math。Prog。14(1):265-294,1978年)。可以通过改进的贪婪算法有效地优化文本单位相似度。我们表明,可以将四个已知的摘要任务(包括通用,针对查询,更新和比较摘要)建模为从所提出的框架派生的不同变体。进行了基准摘要数据集(例如DUC04-06,TAC08,TDT2语料库)的实验,以证明我们提出的框架对一般多文档摘要任务的有效性和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号