首页> 外文会议>Workshop on evaluation metrics and system comparison for automatic summarization 2012 >Discrepancy Between Automatic and Manual Evaluation of Summaries
【24h】

Discrepancy Between Automatic and Manual Evaluation of Summaries

机译:自动和手动摘要评估之间的差异

获取原文
获取原文并翻译 | 示例

摘要

Today, automatic evaluation metrics such as ROUGE have become the de-facto mode of evaluating an automatic summarization system. However, based on the DUC and the TAC evaluation results, (Conroy and Schlesinger, 2008; Dang and Owczarzak, 2008) showed that the performance gap between human-generated summaries and system-generated summaries is clearly visible in manual evaluations but is often not reflected in automated evaluations using ROUGE scores. In this paper, we present our own experiments in comparing the results of manual evaluations versus automatic evaluations using our own text summarizer: BlogSum. We have evaluated BlogSum-generated summary content using ROUGE and compared the results with the original candidate list (OList). The t-test results showed that there is no significant difference between BlogSum-generated summaries and OList summaries. However, two manual evaluations for content using two different datasets show that BlogSum performed significantly better than OList. A manual evaluation of summary coherence also shows that BlogSum performs significantly better than OList. These results agree with previous work and show the need for a better automated summary evaluation metric rather than the standard ROUGE metric.
机译:如今,诸如ROUGE之类的自动评估指标已成为评估自动摘要系统的实际模式。但是,根据DUC和TAC评估结果,(Conroy和Schlesinger,2008; Dang和Owczarzak,2008)表明,人工生成的摘要和系统生成的摘要之间的性能差距在手动评估中清晰可见,但通常不会反映在使用ROUGE分数的自动评估中。在本文中,我们展示了我们自己的实验,使用我们自己的文本摘要程序BlogSum比较手动评估和自动评估的结果。我们使用ROUGE评估了BlogSum生成的摘要内容,并将结果与​​原始候选列表(OList)进行了比较。 t检验结果表明BlogSum生成的摘要和OList摘要之间没有显着差异。但是,使用两个不同的数据集对内容进行的两次手动评估表明,BlogSum的性能明显优于OList。手动评估摘要一致性还显示BlogSum的性能明显优于OList。这些结果与以前的工作一致,表明需要更好的自动化摘要评估度量标准,而不是标准的ROUGE度量标准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号