A Crowdsourcing Approach to Evaluate the Quality of Query-based Extractive Text Summaries

机译：一种评估基于查询的提取文本摘要质量的众包方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

High cost and time consumption are concurrent barriers for research and application of automated summarization. In order to explore options to overcome this barrier, we analyze the feasibility and appropriateness of micro-task crowdsourcing for evaluation of different summary quality characteristics and report an ongoing work on the crowdsourced evaluation of query-based extractive text summaries. To do so, we assess and evaluate a number of linguistic quality factors such as grammaticality, non-redundancy, referential clarity, focus and structure & coherence. Our first results imply that referential clarity, focus and structure & coherence are the main factors effecting the perceived summary quality by crowdworkers. Further, we compare these results using an initial set of expert annotations that is currently being collected, as well as an initial set of automatic quality score ROUGE for summary evaluation. Preliminary results show that ROUGE does not correlate with linguistic quality factors, regardless if assessed by crowd or experts. Further, crowd and expert ratings show highest degree of correlation when assessing low quality summaries. Assessments increasingly divert when attributing high quality judgments.

机译：高成本和时间消耗是自动汇总研究和应用的并发障碍。为了探索克服此障碍的选项，我们分析了微任务众包用于评估不同的摘要质量特征的可行性和适当性，并报告了基于查询的提取文本摘要的众包评估的正在进行的工作。为此，我们评估和评估许多语言质量因素，例如语法，非冗余，参照清晰度，重点和结构与连贯性。我们的第一个结果表明，参照的清晰性，重点，结构和连贯性是影响人群工作者感知的摘要质量的主要因素。此外，我们使用当前正在收集的一组初始专家注释以及一组用于摘要评估的自动质量得分ROUGE初始组来比较这些结果。初步结果表明，无论是通过人群还是专家进行评估，ROUGE与语言质量因素均不相关。此外，在评估低质量摘要时，人群和专家评分显示出最高的相关度。归因于高质量的判断时，评估越来越转移。

著录项

来源
《International Conference on Quality of Multimedia Experience》|2019年|1-3|共3页
会议地点
作者
Neslihan Iskender; Aleksandra Gabryszak; Tim Polzehl; Leonhard Hennig; Sebastian Möller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
query processing; text analysis;

机译：查询处理;文本分析;

相似文献

外文文献
中文文献
专利

1. Mix Multiple Features to Evaluate the Content and the Linguistic Quality of Text Summaries [J] . Ellouze Samira, Jaoua Maher, Hadrich Belguith Lamia Journal of computing and information technology . 2017,第2期

机译：混合使用多种功能来评估文本摘要的内容和语言质量
2. Mix multiple features to evaluate the content and the linguistic quality of text summaries [J] . Ellouze Samira, Jaoua Maher Maher, Belguith Lamia Hadrich Journal of Computing and Information Technology . 2017,第2期

机译：混合使用多种功能来评估内容和文本摘要的语言质量
3. Integrated approach for bioactive quality evaluation of medicinal plant extracts using HPLC-DAD, spectrophotometric, near infrared spectroscopy and chemometric techniques [J] . Ana Bel??ak-Cvitanovi?, Davor Valinger, Maja Benkovi?, International journal of food properties . 2017,第Suppla3期

机译：使用HPLC-DAD，分光光度法，近红外光谱和化学计量技术对药用植物提取物的生物活性质量评价综合方法
4. A Crowdsourcing Approach to Evaluate the Quality of Query-based Extractive Text Summaries [C] . Neslihan Iskender, Aleksandra Gabryszak, Tim Polzehl, International Conference on Quality of Multimedia Experience . 2019

机译：一种评价基于查询的提取文本摘要质量的众群方法
5. Evaluating Observational Learning in a Competitive Two-Sided Crowdsourcing Market: A Bayesian Inferential Approach. [D] . Ayaburi, Emmanuel Wusuhon Yanibo. 2017

机译：在竞争性的双向众包市场中评估观察性学习：贝叶斯推理方法。
6. Extracting Cancer Quality Indicators from Electronic Medical Records: Evaluation of an Ontology-Based Virtual Medical Record Approach [O] . Wei-Nchih Lee, Samson W. Tu, Amar K. Das 2009

机译：从电子病历中提取癌症质量指标：基于本体的虚拟病历方法的评估
7. A Text Mining Approach to Evaluate Submissions to Crowdsourcing Contests [O] . Walter Back, Thomas P. Walter, Andrea Back 2014

机译：一种评估众包竞赛提交的文本挖掘方法
8. Equivalent System Verification and Evaluation of Augmentation Effects on Fighter Approach and Landing Flying Qualities. Volume 1. Summary [R] . Hodgkinson, J., Snyder, R. C., Smith, R. E. 1981

机译：等效系统验证与增强效果对战斗机进近和着陆飞行质量的评估。第1卷。总结

A Crowdsourcing Approach to Evaluate the Quality of Query-based Extractive Text Summaries

摘要

著录项

相似文献

相关主题

期刊订阅