Domain Content Based Protein Function Prediction Using Incomplete GO Annotation Information

机译：基于域内容的蛋白质函数预测使用不完整的GO注释信息

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given the essential role of protein in life processes, computational assignment of protein functions has become one of the most important tasks in the area of bioinformatics. While Gene Ontology (GO) has been widely used in functional annotation, new approaches to address the problem of annotation incompleteness, which can leverage the support of the GO framework, are imminently required. In this paper, two new models are proposed to predict GO terms from domain content: a Correlation Coefficient based model (CC-M) and a Support Vector Machine (SVM) based model (SVM-M). We have developed our models in the form of predictors for all GO terms with manually curated annotations. In comparison with the Bayesian probabilistic approach published previously [Forslund et al., 2008], our methods are demonstrated to have better capability in dealing with incomplete training data. In particular, the CC-M method is suitable for GO terms with extremely low occurrence frequency, and the SVM-M method for the remaining GO terms. Therefore, CC-M and SVM-M are subsequently integrated into a single model (CC-SVM), with their respective advantages combined.

机译：鉴于蛋白质在生活过程中的基本作用，蛋白质功能的计算分配已成为生物信息学领域最重要的任务之一。虽然基因本体（GO）已被广泛应用于功能注释，但是新的方法可以采用能够利用GO框架的支持，以解决不完整性的问题。在本文中，提出了两个新模型来预测域内容的GO条款：基于相关系数基于系数的模型（CC-M）和基于支持向量机（SVM）的模型（SVM-M）。我们通过手动策划注释，以预测因子的形式开发了我们的模型。与之前发布的贝叶斯概率方法相比[Forslund等，2008]，我们的方法被证明是在处理不完整的培训数据方面具有更好的能力。特别地，CC-M方法适用于具有极低出现频率的GO术语，以及用于剩下的GO条款的SVM-M方法。因此，CC-M和SVM-M随后集成到单个型号（CC-SVM）中，其各自的优点组合。

著录项

来源
《IEEE International Conference on Bioinformatics and Biomedicine Workshop》|2009年||共6页
会议地点
作者
Lirong Tan; Zhiwen Yu; Hau-San Wong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G202-53;
关键词
Protein function prediction; GO term; Domain;

机译：蛋白质函数预测;go术语;域名;

相似文献

外文文献
中文文献
专利

1. Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking. [J] . Aloy P, Querol E, Aviles FX, Journal of Molecular Biology . 2001,第2期

机译：蛋白质功能位点的基于结构的自动化预测：用于评估从基因组注释中的同源性到蛋白质对接继承蛋白质功能的有效性的应用。
2. Computational Prediction of Ubiquitination Proteins Using Evolutionary Profiles and Functional Domain Annotation [J] . Wangren Qiu, Chunhui Xu, Xuan Xiao, Current genomics . 2019,第5期

机译：使用进化型材和功能域注释计算泛素蛋白的计算预测
3. Functional classification of CATH superfamilies: a domain-based approach for protein function annotation (vol 31, pg 3460, 2015) [J] . Das Sayoni, Lee David, Sillitoe Ian, Bioinformatics . 2016,第18期

机译：CATH超家族的功能分类：一种基于域的蛋白质功能注释方法（第31卷，第3460页，2015年）
4. Domain Content Based Protein Function Prediction Using Incomplete GO Annotation Information [C] . Lirong Tan, Zhiwen Yu, Hau-San Wong IEEE International Conference on Bioinformatics and Biomedicine Workshop . 2009

机译：基于域内容的蛋白质函数预测使用不完整的GO注释信息
5. Protein structure prediction and structure-based protein function annotation. [D] . Roy, Ambrish. 2011

机译：蛋白质结构预测和基于结构的蛋白质功能注释。
6. Computational Prediction of Ubiquitination Proteins Using Evolutionary Profiles and Functional Domain Annotation [O] . Wangren Qiu, Chunhui Xu, Xuan Xiao, 2019

机译：使用进化轮廓和功能域注释的泛素化蛋白质的计算预测。
7. Protein Function Prediction with Incomplete Annotations [O] . Guoxian Yu, Huzefa Rangwala, Carlotta Domeniconi, 2014

机译：不完全注释的蛋白质功能预测

Domain Content Based Protein Function Prediction Using Incomplete GO Annotation Information

摘要

著录项

相似文献

相关主题

期刊订阅