RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering

机译：重新考虑：使用跨度侧面关注的开放域问题回答改进重新排名

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

State-of-the-art Machine Reading Comprehension (MRC) models for Open-domain Question Answering (QA) are typically trained for span selection using distantly supervised positive examples and heuristically retrieved negative examples. This training scheme possibly explains empirical observations that these models achieve a high recall amongst their top few predictions, but a low overall accuracy, motivating the need for answer re-ranking. We develop a successful re-ranking approach (RECONSIDER) for span-extraction tasks that improves upon the performance of MRC models, even beyond large-scale pre-training. Re-Consider is trained on positive and negative examples extracted from high confidence MRC model predictions, and uses in-passage span annotations to perform span-focused re-ranking over a smaller candidate set. As a result, RECONSIDER learns to eliminate close false positives, achieving a new extractive state of the art on four QA tasks, with 45.5% Exact Match accuracy on Natural Questions with real user questions, and 61.7% on TriviaQA. We will release all related data, models, and code.

机译：最先进的机器阅读理解（MRC）用于开放式域问题应答（QA）的模型通常使用远端监督的正示例和启发式检索的负例进行跨度选择培训。该培训方案可能解释了这些模型在其顶层预测中实现了高召回的实证观察，但总体准确性低，激励了回复重新排名的需求。我们开发成功的重新排名方法（重新考虑），用于跨度提取任务，即使超越大规模预训练，也可以提高MRC模型的性能。重新考虑在高置信频率MRC模型预测中提取的正面和否定示例培训，并使用内部跨度注释来执行以较小的候选集进行跨度重新排序。因此，重新考虑学会消除近距离误报，在四个QA任务中实现了最新的现有技术，对自然问题有45.5％的精确度，具有真实的用户问题，在TriviaQA上有61.7％。我们将释放所有相关数据，模型和代码。

著录项

来源
《Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2021年|1280-1287|共8页
会议地点
作者
Srinivasan Iyer; Sewon Min; Yashar Mehdad; Wen-tau Yih;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Video question answering via grounded cross-attention network learning [J] . Yunan Ye, Shifeng Zhang, Yimeng Li, Information Processing & Management . 2020,第4期

机译：视频问题通过接地的跨关注网络学习回答
2. Linguistic kernels for answer re-ranking in question answering systems [J] . Alessandro Moschitti, Silvia Quarteroni Information Processing & Management . 2011,第6期

机译：语言内核，用于问答系统中的答案重新排名
3. Intelligent Question Answering in Restricted Domains Using Deep Learning and Question Pair Matching [J] . Cai Lin-Qin, Wei Min, Zhou Si-Tong, Quality Control, Transactions . 2020,第期

机译：使用深度学习和问题对匹配在限制域中回答智能问题
4. Improving Interaction with the User in Cross-Language Question Answering Through Relevant Domains and Syntactic Semantic Patterns [C] . Borja Navarro, Lorenza Moreno, Sonia Vazquez, Workshop of the Cross-Language Evaluation Forum . 2005

机译：通过相关域和句法语义模式在跨语言问题中提高与用户的互动
5. Neural Network Models for Tasks in Open-domain and Closed-domain Question Answering [D] . ?Chen, Charles L. 2020

机译：神经网络模型在开放领域和闭域问答系统任务
6. Toward Automated Consumer Question Answering: Automatically Separating Consumer Questions from Professional Questions in the Healthcare Domain [O] . Feifan Liu, Lamont D. Antieau, Hong Yu -1

机译：迈向自动消费者问题应答：自动将消费者问题分开在医疗保健领域的专业问题
7. Ranking Paragraphs for Improving Answer Recall in Open-Domain Question Answering [O] . Jinhyuk Lee, Seongjun Yun, Hyunjae Kim, 2018

机译：在公开域问题应答中改进答案召回的排名段落

RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering

摘要

著录项

相似文献

相关主题

期刊订阅