首页> 外文会议>Workshop on evaluation metrics and system comparison for automatic summarization 2012 >The Heterogeneity Principle in Evaluation Measures for Automatic Summarization
【24h】

The Heterogeneity Principle in Evaluation Measures for Automatic Summarization

机译:自动汇总评估方法中的异质性原则

获取原文
获取原文并翻译 | 示例

摘要

The development of summarization systems requires reliable similarity (evaluation) measures that compare system outputs with human references. A reliable measure should have correspondence with human judgements. However, the reliability of measures depends on the test collection in which the measure is meta-evaluated; for this reason, it has not yet been possible to reliably establish which are the best evaluation measures for automatic summarization. In this paper, we propose an unsupervised method called Heterogeneity-Based Ranking (HBR) that combines summarization evaluation measures without requiring human assessments. Our empirical results indicate that HBR achieves a similar correspondence with human assessments than the best single measure for every observed corpus. In addition, HBR results are more robust across topics than single measures.
机译:摘要系统的开发需要可靠的相似性(评估)措施,以比较系统输出和人工参考。可靠的措施应与人类的判断相一致。但是,度量的可靠性取决于对度量进行元评估的测试集合。因此,尚无法可靠地确定哪些是自动汇总的最佳评估方法。在本文中,我们提出了一种称为“基于异质性排名”(HBR)的无监督方法,该方法结合了汇总评估方法而无需人工评估。我们的经验结果表明,与对每个观察到的语料库的最佳单一度量相比,HBR在人类评估中获得了相似的对应。此外,跨主题的HBR结果比单一指标更可靠。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号