首页> 外文会议>Conference on empirical methods in natural language processing >Better Summarization Evaluation with Word Embeddings for ROUGE
【24h】

Better Summarization Evaluation with Word Embeddings for ROUGE

机译:使用词嵌入对ROUGE进行更好的汇总评估

获取原文

摘要

ROUGE is a widely adopted, automatic evaluation measure for text summarization. While it has been shown to correlate well with human judgements, it is biased towards surface lexical similarities. This makes it unsuitable for the evaluation of abstractive summarization, or summaries with substantial paraphrasing. We study the effectiveness of word embeddings to overcome this disadvantage of ROUGE. Specifically, instead of measuring lexical overlaps, word embeddings are used to compute the semantic similarity of the words used in summaries instead. Our experimental results show that our proposal is able to achieve better correlations with human judgements when measured with the Spearman and Kendall rank coefficients.
机译:ROUGE是一种广泛采用的自动评估文本摘要的方法。虽然已证明它与人类的判断有很好的相关性,但它偏向表面词汇相似性。这使其不适合抽象摘要或带有实质性措辞的摘要的评估。我们研究了词嵌入的有效性,以克服ROUGE的这一缺点。具体来说,不是测量词汇重叠,而是使用词嵌入来计算摘要中使用的词的语义相似性。我们的实验结果表明,当使用Spearman和Kendall秩系数进行测量时,我们的建议能够与人类的判断获得更好的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号