首页> 外文期刊>Iranian Journal of Science and Technology, Transactions of Electrical Engineering >MSCSO: Extractive Multi-document Summarization Based on a New Criterion of Sentences Overlapping
【24h】

MSCSO: Extractive Multi-document Summarization Based on a New Criterion of Sentences Overlapping

机译:MSCSO:基于句子重叠的新标准的提取多文件摘要

获取原文
获取原文并翻译 | 示例
       

摘要

Extractive multi-document summarization receives a set of documents and extracts the important sentences to form a summary. This paper proposes a novel multi-document summarization with sentences overlapping. First, we preprocess multi-document and calculate 12 features of each sentence. This paper suggests four new features: ROUGE-1 and ROUGE-2 score between the sentence and a single document, ROUGE-1 and ROUGE-2 score between the sentence and multiple documents, also a new definition of sentence overlapping feature. Then, we assign each sentence a score by the learned model. We calculate pairwise overlapping between the sentences and finally select the sentences with higher score and less redundancy. These sentences are given to form the final summary to output under a length constraint. Our method is language free, and it can be implemented on other languages with minor changes. The proposed method is tested on DUC 2006 and 2007 datasets. The effectiveness of this technique is measured using the ROUGE score, and the results are promising when they have been compared with some existing methods.
机译:提取多文件摘要收到一组文档并提取重要句子以形成摘要。本文提出了一种与句子重叠的新型多文件摘要。首先,我们预处理多文档并计算每个句子的12个功能。本文提出了四个新功能:句子和单个文档之间的Rouge-1和Rouge-2分数,句子和多个文件之间的Rouge-1和Rouge-2得分,也是句子重叠功能的新定义。然后,我们通过学习模型分配每个句子。我们计算句子之间的成对重叠,最后选择具有较高分数和更少冗余的句子。给出这些句子以在长度约束下形成最终摘要。我们的方法是免费的语言,它可以在其他语言上实现,其中更改。在DUC 2006和2007数据集上测试了该方法。使用Rouge分数测量该技术的有效性,并且在与一些现有方法进行比较时,结果是有前途的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号