首页> 外文期刊>Intelligent Control and Automation >Multi-Document Summarization Model Based on Integer Linear Programming
【24h】

Multi-Document Summarization Model Based on Integer Linear Programming

机译:基于整数线性规划的多文档摘要模型

获取原文
           

摘要

This paper proposes an extractive generic text summarization model that generates summaries by selecting sentences according to their scores. Sentence scores are calculated using their extensive coverage of the main content of the text, and summaries are created by extracting the highest scored sentences from the original document. The model formalized as a multiobjective integer programming problem. An advantage of this model is that it can cover the main content of source (s) and provide less redundancy in the generated sum- maries. To extract sentences which form a summary with an extensive coverage of the main content of the text and less redundancy, have been used the similarity of sentences to the original document and the similarity between sentences. Performance evaluation is conducted by comparing summarization outputs with manual summaries of DUC2004 dataset. Experiments showed that the proposed approach outperforms the related methods.
机译:本文提出了一种提取性通用文本摘要模型,该模型通过根据句子的分数选择句子来生成摘要。句子分数是通过使用它们广泛覆盖的文本主要内容来计算的,摘要是通过从原始文档中提取得分最高的句子来创建的。该模型形式化为多目标整数规划问题。该模型的优势在于它可以覆盖源的主要内容,并在生成的摘要中提供较少的冗余。为了提取构成摘要的句子,该句子具有对文本主要内容的广泛覆盖,并且减少了冗余,已经使用了句子与原始文档的相似性以及句子之间的相似性。通过将摘要输出与DUC2004数据集的手动摘要进行比较来进行性能评估。实验表明,该方法优于相关方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号