首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Tackling the Story Ending Biases in The Story Cloze Test
【24h】

Tackling the Story Ending Biases in The Story Cloze Test

机译:解决故事的故事在故事中的故事中的偏见

获取原文

摘要

The Story Cloze Test (SCT) is a recent framework for evaluating story comprehension and script learning. There have been a variety of models tackling the SCT so far. Although the original goal behind the SCT was to require systems to perform deep language understanding and commonsense reasoning for successful narrative understanding, some recent models could perform significantly better than the initial baselines by leveraging human-authorship biases discovered in the SCT dataset. In order to shed some light on this issue, we have performed various data analysis and analyzed a variety of top performing models presented for this task. Given the statistics we have aggregated, we have designed a new crowd-sourcing scheme that creates a new SCT dataset, which overcomes some of the biases. We benchmark a few models on the new dataset and show that the top-performing model on the original SCT dataset fails to keep up its performance. Our findings further signify the importance of benchmarking NLP systems on various evolving test sets.
机译:故事隐藏性测试(SCT)是最近评估故事理解和脚本学习的框架。到目前为止,还有各种模型解决SCT。虽然SCT背后的原始目标是要求系统对成功的叙事理解进行深入的语言理解和顽强推理,但最近的模型可以通过利用SCT数据集中发现的人权作者偏见来表现得比初始基座更好。为了在这个问题上阐明一些亮点,我们已经进行了各种数据分析,并分析了为此任务提供的各种顶级执行模型。鉴于我们汇总的统计数据,我们设计了一种新的人群采购方案,创建了一个新的SCT数据集,它克服了一些偏差。我们在新数据集上进行几个模型,并显示原始SCT数据集上的顶级模型无法跟上其性能。我们的调查结果进一步表示基准测试NLP系统对各种不断发展的测试集的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号