首页> 外文期刊>BMC Bioinformatics >BioRel: towards large-scale biomedical relation extraction
【24h】

BioRel: towards large-scale biomedical relation extraction

机译:Biorel:迈向大规模生物医学关系提取

获取原文
           

摘要

Although biomedical publications and literature are growing rapidly, there still lacks structured knowledge that can be easily processed by computer programs. In order to extract such knowledge from plain text and transform them into structural form, the relation extraction problem becomes an important issue. Datasets play a critical role in the development of relation extraction methods. However, existing relation extraction datasets in biomedical domain are mainly human-annotated, whose scales are usually limited due to their labor-intensive and time-consuming nature. We construct BioRel, a large-scale dataset for biomedical relation extraction problem, by using Unified Medical Language System as knowledge base and Medline as corpus. We first identify mentions of entities in sentences of Medline and link them to Unified Medical Language System with Metamap. Then, we assign each sentence a relation label by using distant supervision. Finally, we adapt the state-of-the-art deep learning and statistical machine learning methods as baseline models and conduct comprehensive experiments on the BioRel dataset. Based on the extensive experimental results, we have shown that BioRel is a suitable large-scale datasets for biomedical relation extraction, which provides both reasonable baseline performance and many remaining challenges for both deep learning and statistical methods.
机译:虽然生物医学出版物和文学正在迅速增长,但仍然缺乏通过计算机程序轻松处理的结构化知识。为了从纯文本中提取这些知识并将它们转换为结构形式,关系提取问题成为一个重要问题。数据集在关系开发方法的开发中发挥着关键作用。然而,生物医学领域的现有关系提取数据集主要是人类注释,其尺度通常由于其劳动密集型和耗时的性质而受到限制。我们通过使用统一的医疗语言系统作为知识库和MEDLINE作为语料库,构建BIOREL,是生物医学关系提取问题的大规模数据集。我们首先确定Medline句子的实体提出,并将其与Metamap的统一医学语言系统联系起来。然后,我们通过使用遥控监督分配每个句子标签。最后,我们将最先进的深度学习和统计机器学习方法调整为基线模型,并在Biorel数据集上进行综合实验。基于广泛的实验结果,我们已经表明,Biusel是一种适用于生物医学关系提取的大型数据集,这为深度学习和统计方法提供了合理的基线性能和许多剩余挑战。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号