首页> 外文会议>Conference on computational natural language learning >Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets
【24h】

Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets

机译:使您的数据集多样化:通过对抗性数据集中的受控方差分析泛化

获取原文

摘要

Phenomenon-specific "adversarial" datasets have been recently designed to perform targeted stress-tests for particular inference types. Recent work (Liu et al., 2019a) proposed that such datasets can be utilized for training NLI and other types of models, often allowing to learn the phenomenon in focus and improve on the challenge dataset, indicating a "blind spot" in the original training data. Yet, although a model can improve in such a training process, it might still be vulnerable to other challenge datasets targeting the same phenomenon but drawn from a different distribution, such as having a different syntactic complexity level. In this work, we extend this method to drive conclusions about a model's ability to learn and generalize a target phenomenon rather than to "learn" a dataset. by controlling additional aspects in the adversarial datasets. We demonstrate our approach on two inference phenomena - dative alternation and numerical reasoning, elaborating, and in some cases contradicting, the results of Liu et al.. Our methodology enables building better challenge datasets for creating more robust models, and may yield better model understanding and subsequent overarching improvements.
机译:现象特定的“对抗”数据集最近已被设计为针对特定推理类型执行针对性的压力测试。最近的工作(Liu等人,2019a)提出,此类数据集可用于训练NLI和其他类型的模型,通常可以让人们重点学习现象并改进挑战数据集,这表明原始数据中存在“盲点”。训练数据。但是,尽管模型可以在这样的训练过程中进行改进,但它可能仍然容易受到针对相同现象但来自不同分布(例如具有不同句法复杂性级别)的其他挑战数据集的攻击。在这项工作中,我们扩展了这种方法,以得出有关模型学习和概括目标现象而不是“学习”数据集的能力的结论。通过控制对抗性数据集中的其他方面。我们展示了我们针对两种推理现象的方法-和格交替和数值推理,阐述了Liu等人的结果,在某些情况下与之相矛盾。我们的方法可以构建更好的挑战数据集,以创建更强大的模型,并可能产生更好的模型理解以及随后的总体改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号