Posing Fair Generalization Tasks for Natural Language Inference

机译：摆出自然语言推理的公平概括任务

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep learning models for semantics are generally evaluated using naturalistic corpora. Adversarial methods, in which models are evaluated on new examples with known semantic properties, have begun to reveal that good performance at these naturalistic tasks can hide serious shortcomings. However, we should insist that these evaluations be fair - that the models are given data sufficient to support the requisite kinds of generalization. In this paper, we define and motivate a formal notion of fairness in this sense. We then apply these ideas to natural language inference by constructing very challenging but provably fair artificial datasets and showing that standard neural models fail to generalize in the required ways; only task-specific models that jointly compose the premise and hypothesis are able to achieve high performance, and even these models do not solve the task perfectly.

机译：语义的深度学习模型通常使用自然主义语料库进行评估。在具有已知语义属性的新示例上评估模型的对抗方法已经开始揭示，在这些自然主义任务上的良好性能可能掩盖了严重的缺陷。但是，我们应该坚持认为这些评估是公平的-为模型提供足够的数据来支持必要的一般化。在本文中，我们从这种意义上定义并激发了形式上的公平概念。然后，我们通过构建非常具有挑战性但可证明是公平的人工数据集，并证明标准神经模型无法以所需的方式进行概括，将这些思想应用于自然语言推理。只有共同构成前提和假设的特定于任务的模型才能实现高性能，甚至这些模型也不能完美地解决任务。

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing》|2019年|4484-4494|共11页
会议地点
作者
Atticus Geiger; Ignacio Cases; Lauri Karttunen; Christopher Potts;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Natural language inference for Malayalam language using language agnostic sentence representation [J] . Sara Renjit, Sumam Idicula PeerJ Computer Science . 2021,第a期

机译：使用语言无关句子表示的Malayalam语言的自然语言推断
2. Relational memory generalization and integration in a transitive inference task with and without instructed awareness [J] . MunnellyA., DymondS. Neurobiology of learning and memory . 2014,第Null期

机译：关系记忆的泛化和集成，在有或没有意识的情况下进行的传递推理任务中
3. Russian-Language Thesauri: Automatic Construction and Application for Natural Language Processing Tasks [J] . Automatic Control and Computer Sciences . 2019,第7期

机译：俄语译文：自动构建和自然语言处理任务的应用
4. Posing Fair Generalization Tasks for Natural Language Inference [C] . Atticus Geiger, Ignacio Cases, Lauri Karttunen, International joint conference on natural language processing . 2019

机译：构成自然语言推理的公平泛化任务
5. The comparative effects of simple and complex instructional language on the acquisition and generalization of receptive language tasks by children with autism [D] . Murphy, Corinne Marie 2006

机译：简单和复杂的教学语言对自闭症儿童接受和概括接受语言任务的比较效果
6. Quantifying the incidence and burden of herpes zoster in New Zealand general practice: a retrospective cohort study using a natural language processing software inference algorithm [O] . Nikki M Turner, Jayden MacRae, Mary L Nowlan, 2018

机译：量化新西兰带状疱疹的发病率和负担：使用自然语言处理软件推理算法的回顾性队列研究
7. Posing Fair Generalization Tasks for Natural Language Inference [O] . Atticus Geiger, Ignacio Cases, Lauri Karttunen, 2019

机译：构成自然语言推理的公平泛化任务

Posing Fair Generalization Tasks for Natural Language Inference

摘要

著录项

相似文献

相关主题

期刊订阅