【24h】

Generation-Augmented Retrieval for Open-Domain Question Answering

机译:开放域问题应答的代代增强检索

获取原文

摘要

We propose Generation-Augmented Retrieval (GAR) for answering open-domain questions, which augments a query through text generation of heuristically discovered relevant contexts without external resources as supervision. We demonstrate that the generated contexts substantially enrich the semantics of the queries and GAR with sparse representations (BM25) achieves comparable or better performance than state-of-the-art dense retrieval methods such as DPR (Karpukhin et al., 2020). We show thai generating diverse contexts for a query is beneficial as fusing their results consistently yields better retrieval accuracy. Moreover, as sparse and dense representations are often complementary, GAR can be easily combined with DPR to achieve even better performance. GAR achieves state-of-the-art performance on Natural Questions and TriviaQA datasets under the extractive QA setup when equipped with an extractive reader, and consistently outperforms other retrieval methods when the same generative reader is used.
机译:我们提出了用于回答开放式检索(GAR)的生成,通过文本生成在没有外部资源作为监督的情况下通过文本生成增加了查询。我们证明产生的上下文基本上丰富了稀疏表示的查询和GAR的语义(BM25)的性能比最先进的密集检索方法(如DPR)实现了比较或更好的性能(Karpukhin等,2020)。我们展示了泰国为查询产生多样化的上下文是有益的,因为它们的结果一致产生更好的检索精度。此外,由于稀疏和致密的表示通常是互补的,可以轻松地与DPR轻松结合DPR以实现更好的性能。当配备有电动读卡器时,加入在Extract QA设置下实现最先进的性能,并在采用电动读卡器时,当使用相同的生成读卡器时,始终如一地占外的其他检索方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号