A Methodology for Extrinsic Evaluation of Text Summarization:Does ROUGE Correlate?

机译：文本摘要的外部评估方法：ROUGE是否相关？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper demonstrates the usefulness of summariesin an extrinsic task of relevance judgmentbased on a new method for measuring agreement,Relevance-Prediction, which compares subjects’judgments on summaries with their own judgmentson full text documents. We demonstrate that,because this measure is more reliable than previousgold-standard measures, we are able to makestronger statistical statements about the benefits ofsummarization. We found positive correlations betweenROUGE scores and two different summarytypes, where only weak or negative correlationswere found using other agreement measures. However,we show that ROUGE may be sensitive to thechoice of summarization style. We discuss the importanceof these results and the implications for futuresummarization evaluations.

机译：本文展示了总结的有用性在相关性判断的外部任务中基于一种新的衡量协议的方法，关联性预测，用于比较主题的对总结的判断与自己的判断在全文文件上。我们证明，因为这种措施比以前更可靠黄金标准的措施，我们有能力做出有关以下方面好处的更强有力的统计声明总结。我们发现 ROUGE分数和两个不同的摘要类型，其中只有弱或负相关是通过其他协议措施发现的。然而，我们表明ROUGE可能对摘要样式的选择。我们讨论重要性这些结果及其对未来的影响总结评估。

著录项

来源
《43rd Annual Meeting of the Association for Computational Linguistics: Proceeding of the Conference》|2005年|1-8|共8页
会议地点
作者
Bonnie J. Dorr; Christof Monz; Stacy President; Richard Schwartz; David Zajic;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Text Summarization Challenge 2 Text summarization evaluation at NTCIR Workshop 3 [J] . Manabu Okumura, Takahiro Fukusima, Hidetsugu Nanba, ACM SIGIR FORUM . 2004,第1期

机译：文字摘要挑战2 NTCIR研讨会3的文字摘要评估
2. Text Summarization and Discovery of Frames and Relationship from Natural Language Text - A R&D Methodology [J] . P.Chakrabarti, J.K. Basu International Journal on Computer Science and Engineering . 2010,第3期

机译：文本摘要以及从自然语言文本中发现框架和关系的研究方法
3. Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges [J] . Dima Suleiman, Arafat Awajan Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：基于深度学习的抽象文本摘要：方法，数据集，评估措施和挑战
4. Re-evaluating Automatic Summarization with bleu and 192 Shades of rouge [C] . Yvette Graham Conference on empirical methods in natural language processing . 2015

机译：使用bleu和192个胭脂阴影重新评估自动汇总
5. Statistical analysis of text summarization evaluation [D] . Rankel, Peter A. 2016

机译：文本摘要评估的统计分析
6. Usability evaluation of an experimental text summarization system and three search engines: implications for the reengineering of health care interfaces. [O] . Andre W. Kushniruk, Min-Yem Kan, Kathleen McKeown, 2002

机译：实验性文本摘要系统和三个搜索引擎的可用性评估：对医疗保健界面的重新设计的意义。
7. TEXT SUMMARIZATION EVALUATION: CORRELATING HUMAN PERFORMANCE ON AN EXTRINSIC TASK WITH AUTOMATIC INTRINSIC METRICS [O] . Stacy F. President, Bonnie J. Dorr 2006

机译：文本摘要评估：将人的绩效与自动内在指标相关联
8. Text Summarization Evaluation: Correlating Human Performance on an Extrinsic Task with Automatic Intrinsic Metrics [R] . President, S. F. , Dorr, B. J. 2006

机译：文本摘要评估：将外部任务的人员绩效与自动内在度量相关联

A Methodology for Extrinsic Evaluation of Text Summarization:Does ROUGE Correlate?

摘要

著录项

相似文献

相关主题

期刊订阅