首页> 外文会议>Computer and Computational Sciences (IMSCCS), 2007 Second International Multisymposium on >A Weighted k-Nearest Neighbor Method for Gene Ontology BasedProtein Function Prediction
【24h】

A Weighted k-Nearest Neighbor Method for Gene Ontology BasedProtein Function Prediction

机译:基于本体的蛋白质功能预测的加权k最近邻方法

获取原文

摘要

Numerous genome projects have produced a large and ever increasing amount of genomic sequence data. However, the biological functions of many proteins encoded by the sequences remain unknown. Protein function annotation and prediction become an essential and challenging task of post-genomic research. In this paper, we present an automated protein function prediction system based on a set of proteins of known biological functions. The functions of the proteins are characterized with gene ontology (GO) annotations. The prediction system uses a novel measure to calculate the pair-wise overall similarity between protein sequences. The protein function prediction is performed based on the GO annotations of similar sequences using a weighted k-nearest neighbor method. We show the prediction accuracies obtained using the model organism yeast (Sacchyromyces cerevisiae). The results indicate that the weighted k-nearest neighbor method significantly outperforms the regular k-nearest neighbor method for protein molecular function prediction.
机译:许多基因组计划已经产生了大量且不断增加的基因组序列数据。但是,由该序列编码的许多蛋白质的生物学功能仍然未知。蛋白质功能注释和预测成为后基因组研究必不可少的挑战性任务。在本文中,我们提出了一种基于已知生物学功能的蛋白质的自动化蛋白质功能预测系统。蛋白质的功能通过基因本体论(GO)注释进行表征。该预测系统使用一种新颖的方法来计算蛋白质序列之间的成对总体相似性。基于相似序列的GO注释,使用加权k最近邻法进行蛋白质功能预测。我们显示了使用模型生物酵母(Sacchyromyces cerevisiae)获得的预测精度。结果表明,加权k-最近邻方法在蛋白质分子功能预测方面明显优于常规k-最近邻方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号