Summarization Evaluation meets Short-Answer Grading

机译：总结评估满足短期评分

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Summarization Evaluation and Short-Answer Grading share the challenge of automatically evaluating content quality. Therefore, we explore the use of ROUGE, a well-known Summarization Evaluation method, for Short-Answer Grading. We find a reliable ROUGE parametrization that is robust across corpora and languages and produces scores that are significantly cor-related with human short-answer grades. ROUGE adds no information to Short-Answer Grading NLP-based machine learn-ing features in a by-corpus evaluation. However, on a question-by-question basis, we find that the ROUGE Recall score may outperform standard NLP features. We therefore suggest to use ROUGE within a framework for per-question feature se-lection or as a reliable and reproducible baseline for SAG.

机译：汇总评估和短期评分面临自动评估内容质量的挑战。因此，我们探索将ROUGE（一种众所周知的汇总评估方法）用于短期答案分级。我们找到了可靠的ROUGE参数化方法，该参数化方法在语料库和语言之间均非常可靠，并且产生的分数与人类简短答案等级显着相关。在实体评估中，ROUGE不会向基于NLP的短答案分级的机器学习功能添加任何信息。但是，在逐个问题的基础上，我们发现ROUGE Recall得分可能优于标准的NLP功能。因此，我们建议在针对每个问题的特征选择框架内使用ROUGE，或将其用作SAG的可靠且可重现的基准。

著录项

来源
《Workshop on natural language processing for computer assisted language learning》|2019年|79-85|共7页
会议地点 Turku(FI)
作者
Margot Mieskes; Ulrike Pado;
展开▼
作者单位

Hochschule Darmstadt Germany;

Hochschule fuer Technik Stuttgart Germany;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Going deeper: Automatic short-answer grading by combining student and question models [J] . Yuan Zhang, Chen Lin, Min Chi User modeling and user-adapted interaction . 2020,第1期

机译：更深入：结合学生模型和问题模型，自动进行简短答案评分
2. Feature Engineering and Ensemble-Based Approach for Improving Automatic Short-Answer Grading Performance [J] . Sahu Archana, Bhowmick Plaban Kumar Learning Technologies, IEEE Transactions on . 2020,第1期

机译：基于工程和合奏的改进自动答案分级性能的方法
3. Short-answer grading using textual entailment [J] . Basak Rohini, Naskar Sudip Kumar, Gelbukh Alexander Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第5期

机译：使用文本素质的短答双重分级
4. Summarization Evaluation meets Short-Answer Grading [C] . Margot Mieskes, Ulrike Pado Workshop on natural language processing for computer assisted language learning . 2019

机译：摘要评估符合短答案评分
5. Utilizing Marzano's summarizing and note taking strategies on seventh grade students' mathematics performance [D] . Jeanmarie-Gardner, Charmaine 2013

机译：利用Marzano总结和注意参考七年级学生数学绩效的策略
6. A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method [O] . Illhoi Yoo, Xiaohua Hu, Il-Yeol Song 2007

机译：基于相干图的生物医学文献语义聚类和总结方法及新的评价方法
7. A study of the short-answer objectively-scored test as an evaluation instrument in tenth grade foods [O] . Huey Betty Frances. 1964

机译：短期答案客观评分测试作为十年级食品评估工具的研究

Summarization Evaluation meets Short-Answer Grading

摘要

著录项

相似文献

相关主题

期刊订阅