首页> 美国卫生研究院文献>Database: The Journal of Biological Databases and Curation >A semi-automated methodology for finding lipid-related GO terms
【2h】

A semi-automated methodology for finding lipid-related GO terms

机译:半自动方法用于查找与脂质相关的GO术语

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Although semantic similarity in Gene Ontology (GO) and other approaches may be used to find similar GO terms, there is yet a method to systematically find a class of GO terms sharing a common property with high accuracy (e.g. involving human curation).>Results: We have developed a methodology to address this issue and applied it to identify lipid-related GO terms, owing to the important and varied roles of lipids in many biological processes. Our methodology finds lipid-related GO terms in a semi-automated manner, requiring only moderate manual curation. We first obtain a list of lipid-related gold-standard GO terms by keyword search and manual curation. Then, based on the hypothesis that co-annotated GO terms share similar properties, we develop a machine learning method that expands the list of lipid-related terms from the gold standard. Those terms predicted most likely to be lipid related are examined by a human curator following specific curation rules to confirm the class labels. The structure of GO is also exploited to help reduce the curation effort. The prediction and curation cycle is repeated until no further lipid-related term is found. Our approach has covered a high proportion, if not all, of lipid-related terms with relatively high efficiency.>Database URL:
机译:>动机:尽管可以使用基因本体论(GO)和其他方法中的语义相似性来查找相似的GO术语,但仍存在一种方法来系统地找到具有高度共有属性的GO术语类别(例如涉及人类的治疗)。>结果:由于脂质在许多生物过程中的重要作用和变化,我们已经开发出一种方法来解决此问题,并将其应用于识别与脂质相关的GO术语。我们的方法以半自动化的方式查找与脂质相关的GO术语,仅需要适度的手动管理。我们首先通过关键字搜索和手动管理获得脂质相关的黄金标准GO术语列表。然后,基于共同注释的GO术语具有相似属性的假设,我们开发了一种机器学习方法,该方法扩展了金标准中与脂质相​​关的术语的列表。那些被预测为最有可能与脂质相关的术语将由人类策展人按照特定的策展规则进行检查,以确认类别标签。 GO的结构也被用来帮助减少策展工作。重复预测和策展周期,直到找不到其他脂质相关术语为止。我们的方法以相对较高的效率涵盖了很大一部分(即使不是全部)与脂质相关的术语。>数据库网址:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号