首页> 美国卫生研究院文献>Database: The Journal of Biological Databases and Curation >Construction of biological networks from unstructured information based on a semi-automated curation workflow
【2h】

Construction of biological networks from unstructured information based on a semi-automated curation workflow

机译:基于半自动化策展工作流从非结构化信息构建生物网络

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Capture and representation of scientific knowledge in a structured format are essential to improve the understanding of biological mechanisms involved in complex diseases. Biological knowledge and knowledge about standardized terminologies are difficult to capture from literature in a usable form. A semi-automated knowledge extraction workflow is presented that was developed to allow users to extract causal and correlative relationships from scientific literature and to transcribe them into the computable and human readable Biological Expression Language (BEL). The workflow combines state-of-the-art linguistic tools for recognition of various entities and extraction of knowledge from literature sources. Unlike most other approaches, the workflow outputs the results to a curation interface for manual curation and converts them into BEL documents that can be compiled to form biological networks. We developed a new semi-automated knowledge extraction workflow that was designed to capture and organize scientific knowledge and reduce the required curation skills and effort for this task. The workflow was used to build a network that represents the cellular and molecular mechanisms implicated in atherosclerotic plaque destabilization in an apolipoprotein-E-deficient (ApoE −/− ) mouse model. The network was generated using knowledge extracted from the primary literature. The resultant atherosclerotic plaque destabilization network contains 304 nodes and 743 edges supported by 33 PubMed referenced articles. A comparison between the semi-automated and conventional curation processes showed similar results, but significantly reduced curation effort for the semi-automated process. Creating structured knowledge from unstructured text is an important step for the mechanistic interpretation and reusability of knowledge. Our new semi-automated knowledge extraction workflow reduced the curation skills and effort required to capture and organize scientific knowledge. The atherosclerotic plaque destabilization network that was generated is a causal network model for vascular disease demonstrating the usefulness of the workflow for knowledge extraction and construction of mechanistically meaningful biological networks.
机译:以结构化格式捕获和表示科学知识对于增进对复杂疾病所涉及的生物学机制的理解至关重要。生物学知识和有关标准化术语的知识很难以可用的形式从文献中获取。提出了一种半自动化的知识提取工作流程,该工作流程旨在允许用户从科学文献中提取因果关系和相关关系,并将其转录为可计算和人类可读的生物表达语言(BEL)。该工作流结合了最先进的语言工具,可识别各种实体并从文献资源中提取知识。与大多数其他方法不同,工作流将结果输出到用于人工策展的策展界面,并将其转换为可编译为生物网络的BEL文档。我们开发了一种新的半自动化知识提取工作流程,该工作流程旨在捕获和组织科学知识,并减少此任务所需的策划技能和工作量。该工作流程用于构建一个网络,该网络代表在载脂蛋白E缺乏(ApoE -/-)小鼠模型中与动脉粥样硬化斑块失稳有关的细胞和分子机制。该网络是使用从原始文献中提取的知识生成的。所得的动脉粥样硬化斑块失稳网络包含304个节点和743条边,并由33篇PubMed参考文章支持。半自动和常规策展过程之间的比较显示了相似的结果,但是大大减少了半自动过程的策展工作量。从非结构化文本创建结构化知识是机械化解释和知识可重用性的重要步骤。我们新的半自动化知识提取工作流程减少了收集和组织科学知识所需的策展技能和工作量。生成的动脉粥样硬化斑块去稳定网络是血管疾病的因果网络模型,证明了工作流程对于知识提取和构建具有机械意义的生物网络的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号