首页> 外文期刊>Bioinformatics >Bio-Ontology and text: bridging the modeling gap
【24h】

Bio-Ontology and text: bridging the modeling gap

机译:生物本体和文本:弥合建模鸿沟

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Natural language processing (NLP) techniques are increasingly being used in biology to automate the capture of new biological discoveries in text, which are being reported at a rapid rate. Yet, information represented in NLP data structures is classically very different from information organized with ontologies as found in model organisms or genetic databases. To facilitate the computational reuse and integration of information buried in unstructured text with that of genetic databases, we propose and evaluate a translational schema that represents a comprehensive set of phenotypic and genetic entities, as well as their closely related biomedical entities and relations as expressed in natural language. In addition, the schema connects different scales of biological information, and provides mappings from the textual information to existing ontologies, which are essential in biology for integration, organization, dissemination and knowledge management of heterogeneous phenotypic information. A common comprehensive representation for otherwise heterogeneous phenotypic and genetic datasets, such as the one proposed, is critical for advancing systems biology because it enables acquisition and reuse of unprecedented volumes of diverse types of knowledge and information from text.
机译:动机:自然语言处理(NLP)技术正越来越多地用于生物学中,以自动捕获文本中的新生物发现,而这些新发现正在迅速报道。但是,NLP数据结构中表示的信息与模式生物或遗传数据库中的本体组织的信息在传统上有很大不同。为了促进非结构化文本中埋藏的信息与遗传数据库的信息的计算重用和集成,我们提出并评估了一个翻译方案,该方案代表了一组全面的表型和遗传实体,以及它们之间密切相关的生物医学实体和相关关系,如表述所示。自然语言。此外,该方案连接了不同规模的生物信息,并提供了从文本信息到现有本体的映射,这对于生物学中异构表型信息的集成,组织,传播和知识管理至关重要。对于异类表型和遗传数据集的通用综合表示形式(如所提出的那样),对于推进系统生物学至关重要,因为它能够从文本中获取和重用前所未有的各种类型的知识和信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号