A categorization approach to automated ontological function annotation

机译：自动本体功能注释的分类方法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automated function prediction (AFP) methods increasingly use knowledge discovery algorithms to map sequence, structure, literature, and/or pathway information about proteins whose functions are unknown into functional ontologies, typically (a portion of) the Gene Ontology (GO). While there are a growing number of methods within this paradigm, the general problem of assessing the accuracy of such prediction algorithms has not been seriously addressed. We present first an application for function prediction from protein sequences using the POSet Ontology Categorizer (POSOC) to produce new annotations by analyzing collections of GO nodes derived from annotations of protein BLAST neighborhoods. We then also present hierarchical precision and hierarchical recall as new evaluation metrics for assessing the accuracy of any predictions in hierarchical ontologies, and discuss results on a test set of protein sequences. We show that our method provides substantially improved hierarchical precision (measure of predictions made that are correct) when applied to the nearest BLAST neighbors of target proteins, as compared with simply imputing that neighborhood's annotations to the target. Moreover, when our method is applied to a broader BLAST neighborhood, hierarchical precision is enhanced even further. In all cases, such increased hierarchical precision performance is purchased at a modest expense of hierarchical recall (measure of all annotations that get predicted at all).

机译：自动化功能预测（AFP）方法越来越多地使用知识发现算法来映射有关功能未知的蛋白质的序列，结构，文献和/或途径信息，这些信息通常是功能本体（GO）的一部分（功能本体）。尽管此范例中的方法越来越多，但尚未认真解决评估此类预测算法准确性的一般问题。我们首先介绍一个使用POSet本体分类器（POSOC）从蛋白质序列进行功能预测的应用，以通过分析从蛋白质BLAST邻域的注释派生的GO节点集合来产生新的注释。然后，我们还将提出分层精度和分层召回作为评估分层本体中任何预测的准确性的新评估指标，并讨论蛋白质序列测试集上的结果。我们表明，与简单地将邻域的注释推算至目标相比，当将其应用于目标蛋白的最近BLAST邻居时，我们的方法可显着提高分层精度（对预测的预测是正确的）。此外，当我们的方法应用于更广泛的BLAST邻域时，分层精度会进一步提高。在所有情况下，购买这种提高的层次精度性能都需要付出一定的代价，即要花费一定的层次回忆（度量所有注释的方法）。

著录项

期刊名称 Protein Science : A Publication of the Protein Society
作者
Karin Verspoor; Judith Cohn; Susan Mniszewski; Cliff Joslyn;
展开▼
作者单位

展开▼
年(卷),期 2006(15),6
年度 2006
页码 1544–1549
总页数 6
原文格式 PDF
正文语种
中图分类分子生物学;
关键词
protein function prediction Gene Ontology GO prediction evaluation metrics;

机译：蛋白质功能预测;基因本体论;GO;预测评估指标;

相似文献

外文文献
中文文献
专利

1. A categorization approach to automated ontological function annotation. [J] . Verspoor K, Cohn J, Mniszewski S, Protein Science: A Publication of the Protein Society . 2006,第6期

机译：一种用于自动本体功能注释的分类方法。
2. PFP: Automated prediction of gene ontology functional annotations with confidence scores using protein sequence data. [J] . Hawkins T, Chitale M, Luban S, Proteins: Structure, Function, and Genetics . 2009,第3期

机译：PFP：使用蛋白质序列数据以置信度分数自动预测基因本体功能注释。
3. Gene ontology annotation as text categorization: An empirical study [J] . Kazuhiro Seki, Javed Mostafa Information Processing & Management . 2008,第5期

机译：基因本体标注作为文本分类的一项实证研究
4. CLUGO: a clustering algorithm for automated functional annotations based on gene ontology [C] . In-Yee Lee, Jan-Ming Ho, Ming-Syan Chen . 2005

机译：CLUGO：基于基因本体的自动功能注释聚类算法
5. A Gaussian Mixture-Based Approach to Synthesizing Nonlinear Feature Functions for Automated Object Detection. [D] . Guo, Pei Fang. 2010

机译：一种基于高斯混合的方法，用于合成非线性特征函数以实现自动目标检测。
6. Protein annotation as term categorization in the gene ontology using word proximity networks [O] . Karin Verspoor, Judith Cohn, Cliff Joslyn, 2005

机译：使用词邻近网络将蛋白质注释作为基因本体中的术语分类
7. A categorization approach to automated ontological function annotation [O] . Verspoor, Karin, Cohn, Judith, Mniszewski, Susan, 2006

机译：自动本体功能注释的分类方法

A categorization approach to automated ontological function annotation

摘要

著录项

相似文献

相关主题

期刊订阅