【24h】

PubMedQA: A Dataset for Biomedical Research Question Answering

机译:PubMedQA:生物医学研究问答的数据集

获取原文

摘要

We introduce PubMedQA. a novel biomedical question answering (QA) dataset collected from PubMed abstracts. The task of PubMedQA is to answer research questions with yeso/maybe (e.g.: Do preoperative statins reduce atrial fibrillation after coronary artery bypass grafting!) using the corresponding abstracts. PubMedQA has 1k expert-annotated, 61.2k unlabeled and 211.3k artificially generated QA instances. Each PubMedQA instance is composed of (1) a question which is either an existing research article title or derived from one. (2) a context which is the corresponding abstract without its conclusion, (3) a long answer, which is the conclusion of the abstract and. presumably, answers the research question, and (4) a yeso/maybe answer which summarizes the conclusion. PubMedQA is the first QA dataset where reasoning over biomedical research texts, especially their quantitative contents, is required to answer the questions. Our best performing model, multi-phase fine-tuning of BioBERT with long answer bag-of-word statistics as additional supervision, achieves 68.1% accuracy, compared to single human performance of 78.0% accuracy and majority-baseline of 55.2% accuracy, leaving much room for improvement.
机译:我们介绍PubMedQA。从PubMed摘要中收集的新型生物医学问答(QA)数据集。 PubMedQA的任务是使用相应的摘要以是/否/也许回答研究问题(例如:术前他汀类药物是否能减少冠状动脉搭桥术后的房颤!)。 PubMedQA具有1k专家注释,61.2k未标记和211.3k人工生成的QA实例。每个PubMedQA实例由(1)个问题组成,该问题可以是现有研究文章的标题,也可以是从中衍生出来的。 (2)上下文是没有其结论的相应摘要,(3)长答案,即摘要和的结论。大概回答了研究问题,并且(4)是/否/也许是回答,总结了结论。 PubMedQA是第一个QA数据集,需要对生物医学研究文本进行推理,尤其是其定量内容才能回答问题。我们性能最佳的模型是BioBERT的多阶段微调,具有长答案词袋统计作为额外的监控,可实现68.1%的准确性,相比之下,单个人的绩效为78.0%的准确性和多数基准为55.2%的准确性,因此有很大的改进空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号