首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Know What You Don't Know: Unanswerable Questions for SQuAD
【24h】

Know What You Don't Know: Unanswerable Questions for SQuAD

机译:知道你不知道的是什么:小队的不答造的问题

获取原文

摘要

Extractive reading comprehension systems can often locate the correct answer to a question in a context document, but they also tend to make unreliable guesses on questions for which the correct answer is not stated in the context. Existing datasets either focus exclusively on answerable questions, or use automatically generated unanswerable questions that are easy to identify. To address these weaknesses, we present SQuADRUn, a new dataset that combines the existing Stanford Question Answering Dataset (SQuAD) with over 50.000 unanswerable questions written adversarially by crowdworkers to look similar to answerable ones. To do well on SQuADRUn, systems must not only answer questions when possible, but also determine when no answer is supported by the paragraph and abstain from answering. SQuADRUn is a challenging natural language understanding task for existing models: a strong neural system that gets 86% F1 on SQuAD achieves only 667c Fl on SQuADRUn. We release SQuADRUn to the community as the successor to SQuAD.
机译:提取阅读理解系统通常可以在上下文文档中定位对问题的正确答案,但它们也倾向于对在上下文中未说明正确答案的问题进行不可靠的猜测。现有数据集专注于可应答问题,或者使用易于识别的自动生成的未签发问题。为了解决这些缺点,我们呈现SquadRun,该数据集结合了现有的斯坦福问题接听数据集(阵容),该数据集(Squad)有超过50,000个不批准的问题,通过人群Wheckers看起来类似于可应答的问题。为了在SquadRun上做得好,系统不仅可以在可能的情况下回答问题,而且还确定段落是否支持答案并禁止回答。 SquadRun是一个具有挑战性的自然语言理解任务,适用于现有模型:一个强大的神经系统,在队中获得86%F1的F1在Squadrun上只能实现667级。我们将Squadrun释放到社区作为队的继任者。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号