首页> 美国卫生研究院文献>other >Collaborative text-annotation resource for disease-centered relation extraction from biomedical text
【2h】

Collaborative text-annotation resource for disease-centered relation extraction from biomedical text

机译:从生物医学文本中提取以疾病为中心的关系的协作文本注释资源

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Agglomerating results from studies of individual biological components has shown the potential to produce biomedical discovery and the promise of therapeutic development. Such knowledge integration could be tremendously facilitated by automated text mining for relation extraction in biomedical literature. Relation extraction systems cannot be developed without substantial datasets annotated with ground truth for benchmarking and training. The creation of such datasets is hampered by the absence of a resource for launching a distributed annotation effort, as well as by the lack of a standardized annotation schema. We have developed an annotation schema and an annotation tool which can be widely adopted so that the resulting annotated corpora from a multitude of disease studies could be assembled into a unified benchmark dataset. The contribution of this paper is threefold. First, we provide an overview of available benchmark corpora and derive a simple annotation schema for specific binary relation extraction problems such as protein-protein and gene-disease relation extraction. Second, we present BioNotate: an open source annotation resource for the distributed creation of a large corpus. Third, we present and make available the results of a pilot annotation effort of the autism disease network.
机译:对单个生物成分的研究的综合结果显示了产生生物医学发现的潜力和治疗发展的希望。通过自动文本挖掘来提取生物医学文献中的关系,可以极大地促进这种知识整合。如果没有大量标有基准测试和训练的事实的数据集,则无法开发关系提取系统。缺少用于启动分布式注释工作的资源以及缺少标准化的注释方案,阻碍了此类数据集的创建。我们已经开发了一种注释方案和注释工具,可以广泛采用该注释方法,以便将来自多种疾病研究的结果注释语料库组装到统一的基准数据集中。本文的贡献是三方面的。首先,我们提供了可用基准语料库的概述,并针对特定的二进制关系提取问题(例如蛋白质-蛋白质和基因-疾病关系提取)推导了简单的注释方案。其次,我们介绍BioNotate:一个用于大型主体的分布式创建的开源注释资源。第三,我们介绍并提供自闭症网络试点注释工作的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号