Tackling the Story Ending Biases in The Story Cloze Test

机译：解决故事的故事在故事中的故事中的偏见

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Story Cloze Test (SCT) is a recent framework for evaluating story comprehension and script learning. There have been a variety of models tackling the SCT so far. Although the original goal behind the SCT was to require systems to perform deep language understanding and commonsense reasoning for successful narrative understanding, some recent models could perform significantly better than the initial baselines by leveraging human-authorship biases discovered in the SCT dataset. In order to shed some light on this issue, we have performed various data analysis and analyzed a variety of top performing models presented for this task. Given the statistics we have aggregated, we have designed a new crowd-sourcing scheme that creates a new SCT dataset, which overcomes some of the biases. We benchmark a few models on the new dataset and show that the top-performing model on the original SCT dataset fails to keep up its performance. Our findings further signify the importance of benchmarking NLP systems on various evolving test sets.

机译：故事隐藏性测试（SCT）是最近评估故事理解和脚本学习的框架。到目前为止，还有各种模型解决SCT。虽然SCT背后的原始目标是要求系统对成功的叙事理解进行深入的语言理解和顽强推理，但最近的模型可以通过利用SCT数据集中发现的人权作者偏见来表现得比初始基座更好。为了在这个问题上阐明一些亮点，我们已经进行了各种数据分析，并分析了为此任务提供的各种顶级执行模型。鉴于我们汇总的统计数据，我们设计了一种新的人群采购方案，创建了一个新的SCT数据集，它克服了一些偏差。我们在新数据集上进行几个模型，并显示原始SCT数据集上的顶级模型无法跟上其性能。我们的调查结果进一步表示基准测试NLP系统对各种不断发展的测试集的重要性。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|xlvii 795 p.|共6页
会议地点
作者
Rishi Sharma; James F. Allen; Omid Bakhshandeh; Nasrin Mostafazadeh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
入库时间 2022-08-20 20:16:16

相似文献

外文文献
中文文献
专利

1. Use of cloze and contrast word procedures in repeated storybook reading: targeting multiple domains. [J] . Bellon Harn ML, Hoffman PR, Harn WE Journal of communication disorders . 2004,第1期

机译：在重复的故事书阅读中使用完形填空和对比词程序：针对多个领域。
2. The Story/Test/Story Method: A Combined Approach to Usability Testing and Contextual Inquiry [J] . Guiseppe Getto Computers and Composition . 2020,第Mara期

机译：故事/测试/故事方法：可用性测试和上下文查询的组合方法
3. Multi-axis testing of concrete-filled steel tube columns forming ductile soft-story in multi-story buildings [J] . Yazdi Hamidreza A., Hashemi M. Javad, Al-Mahaidi Riadh, Journal of Constructional Steel Research . 2021,第Auga期

机译：多层建筑中混凝土钢管柱的多轴试验
4. Tackling the Story Ending Biases in The Story Cloze Test [C] . Rishi Sharma, James F. Allen, Omid Bakhshandeh, Annual meeting of the Association for Computational Linguistics . 2018

机译：解决故事完形测试中的故事结尾偏见
5. The effects of story grammar and story interestingness on children's recall and preference of narratives in standardized reading comprehension tests. [D] . Way, Cynthia F. 1988

机译：故事语法和故事趣味性对标准化阅读理解测试中孩子的回忆和叙述偏好的影响。
6. Storytelling and story testing in domestication [O] . Pascale Gerbault, Robin G. Allaby, Nicole Boivin, 2014

机译：驯化中的讲故事和故事测试
7. A Simple and Effective Approach to the Story Cloze Test [O] . Siddarth Srinivasan, Richa Arora, Mark Riedl 2018

机译：故事强调测试的简单有效的方法

Tackling the Story Ending Biases in The Story Cloze Test

摘要

著录项

相似文献

相关主题

期刊订阅