首页> 外文会议>9th International conference on language resources and evaluation >A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
【24h】

A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization

机译:通用新闻摘要的艺术状态和竞争基线摘要的储存

获取原文

摘要

In the period since 2004, many novel sophisticated approaches for generic multi-document summarization have been developed. Intuitive simple approaches have also been shown to perform unexpectedly well for the task. Yet it is practically impossible to compare the existing approaches directly, because systems have been evaluated on different datasets, with different evaluation measures, against different sets of comparison systems. Here we present a corpus of summaries produced by several state-of-the-art extractive summarization systems or by popular baseline systems. The inputs come from the 2004 DUC evaluation, the latest year in which generic summarization was addressed in a shared task. We use the same settings for ROUGE automatic evaluation to compare the systems directly and analyze the statistical significance of the differences in performance. We show that in terms of average scores the state-of-the-art systems appear similar but that in fact they produce very different summaries. Our corpus will facilitate future research on generic summarization and motivates the need for development of more sensitive evaluation measures and for approaches to system combination in summarization.
机译:自2004年以来,已经开发出许多新的通用多文件摘要方法。直观的简单方法也被证明了对于任务来说意外。然而,实际上是不可能直接比较现有方法,因为系统已经在不同的数据集中评估了不同的评估措施,而不同的比较系统。在这里,我们提出了由几种最先进的提取摘要系统或流行基线系统产生的摘要语料库。该投入来自2004年DUC评估,在共享任务中处理了通用摘要的最新年度。我们使用相同的设置进行胭脂自动评估,可直接比较系统并分析性能差异的统计显着性。我们表明,就平均评分而言,最先进的系统出现相似,但实际上它们产生了非常不同的摘要。我们的语料库将促进未来的通用摘要研究,并激励开发更敏感的评估措施以及总结系统组合的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号