首页> 美国卫生研究院文献>Bioinformatics >Semi-automated ontology generation within OBO-Edit
【2h】

Semi-automated ontology generation within OBO-Edit

机译:OBO-Edit中的半自动本体生成

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Ontologies and taxonomies have proven highly beneficial for biocuration. The Open Biomedical Ontology (OBO) Foundry alone lists over 90 ontologies mainly built with OBO-Edit. Creating and maintaining such ontologies is a labour-intensive, difficult, manual process. Automating parts of it is of great importance for the further development of ontologies and for biocuration.>Results: We have developed the Dresden Ontology Generator for Directed Acyclic Graphs (DOG4DAG), a system which supports the creation and extension of OBO ontologies by semi-automatically generating terms, definitions and parent–child relations from text in PubMed, the web and PDF repositories. DOG4DAG is seamlessly integrated into OBO-Edit. It generates terms by identifying statistically significant noun phrases in text. For definitions and parent–child relations it employs pattern-based web searches. We systematically evaluate each generation step using manually validated benchmarks. The term generation leads to high-quality terms also found in manually created ontologies. Up to 78% of definitions are valid and up to 54% of child–ancestor relations can be retrieved. There is no other validated system that achieves comparable results.By combining the prediction of high-quality terms, definitions and parent–child relations with the ontology editor OBO-Edit we contribute a thoroughly validated tool for all OBO ontology engineers.>Availability: DOG4DAG is available within OBO-Edit 2.1 at >Contact: ;>Supplementary Information: are available at Bioinformatics online.
机译:>动机:本体论和分类学已被证明对生物固化非常有益。仅开放式生物医学本体(OBO)铸造厂就列出了90多种主要由OBO-Edit构建的本体。创建和维护这样的本体是一个劳动密集,困难,手动的过程。自动完成其中的部分对于进一步开发本体和生物固化至关重要。>结果:我们已经开发了用于有向无环图的德累斯顿本体生成器(DOG4DAG),该系统支持创建和扩展通过从PubMed,Web和PDF存储库中的文本半自动生成术语,定义和父子关系来确定OBO本体。 DOG4DAG无缝集成到OBO-Edit中。它通过识别文本中具有统计意义的名词短语来生成术语。对于定义和父子关系,它使用基于模式的Web搜索。我们使用人工验证的基准系统地评估每个生成步骤。术语生成会导致在手动创建的本体中也发现高质量术语。多达78%的定义有效,并且多达54%的儿童祖先关系可以检索。没有其他经过验证的系统可以达到可比的结果。通过将高质量术语,定义和父子关系的预测与本体编辑器OBO-Edit相结合,我们为所有OBO本体工程师提供了经过充分验证的工具。>可用性: DOG4DAG可在OBO-Edit 2.1中找到,网址为>联系方式:; >补充信息:可从在线生物信息学获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号