首页> 外文期刊>Journal of biomedical informatics. >Collaborative text-annotation resource for disease-centered relation extraction from biomedical text.
【24h】

Collaborative text-annotation resource for disease-centered relation extraction from biomedical text.

机译:用于从生物医学文本中提取以疾病为中心的关系的协作文本注释资源。

获取原文
获取原文并翻译 | 示例
           

摘要

Agglomerating results from studies of individual biological components has shown the potential to produce biomedical discovery and the promise of therapeutic development. Such knowledge integration could be tremendously facilitated by automated text mining for relation extraction in the biomedical literature. Relation extraction systems cannot be developed without substantial datasets annotated with ground truth for benchmarking and training. The creation of such datasets is hampered by the absence of a resource for launching a distributed annotation effort, as well as by the lack of a standardized annotation schema. We have developed an annotation schema and an annotation tool which can be widely adopted so that the resulting annotated corpora from a multitude of disease studies could be assembled into a unified benchmark dataset. The contribution of this paper is threefold. First, we provide an overview of available benchmark corpora and derive a simple annotation schema for specific binary relation extraction problems such as protein-protein and gene-disease relation extraction. Second, we present BioNotate: an open source annotation resource for the distributed creation of a large corpus. Third, we present and make available the results of a pilot annotation effort of the autism disease network.
机译:对单个生物成分的研究的综合结果显示了产生生物医学发现的潜力和治疗发展的希望。通过自动文本挖掘来提取生物医学文献中的关系,可以极大地促进这种知识整合。如果没有大量标有基准测试和训练的事实的数据集,则无法开发关系提取系统。缺少用于启动分布式注释工作的资源以及缺少标准化的注释方案,阻碍了此类数据集的创建。我们已经开发了可以广泛采用的注释模式和注释工具,以便将来自多种疾病研究的结果注释语料库组装到统一的基准数据集中。本文的贡献是三方面的。首先,我们概述了可用的基准语料库,并针对特定的二进制关系提取问题(例如蛋白质-蛋白质和基因-疾病关系提取)推导了简单的注释方案。其次,我们介绍BioNotate:一个用于大型主体的分布式创建的开源注释资源。第三,我们提出并提供自闭症网络试点注释工作的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号