...
首页> 外文期刊>Current Bioinformatics >Global Propagation Method for Predicting Protein Function by Integrating Multiple Data Sources
【24h】

Global Propagation Method for Predicting Protein Function by Integrating Multiple Data Sources

机译:通过整合多个数据源预测蛋白质功能的全球传播方法

获取原文
获取原文并翻译 | 示例
           

摘要

Protein function prediction is one of the most important tasks in bioinformatics. Nowadays, high-throughput experiments have generated large scale genomics and proteomics data. To accurately annotate proteins, it is necessary and wise to integrate these heterogeneous data sources. In this paper, a multi-source protein global propagation (MS-PGP) algorithm has been proposed, which integrates multiple data sources and combines protein global propagation with label correlation (PGP) algorithm to predict functions for unannotated proteins. Specifically, we use three data sources to predict protein functions: sequence data, microarray gene expression data and protein-protein interaction data. A naive Bayesian fashion method is adopted to fuse the three data sources into a combined network. Gene ontology biological process annotation is used to calculate the association scores between unannotated proteins and functions. The experimental results on Yeast show that the proposed method has a higher accuracy over other multiple network methods. It is efficient to predict the function of unannotated proteins.
机译:蛋白质功能预测是生物信息学中最重要的任务之一。如今,高通量实验已经产生了大规模的基因组学和蛋白质组学数据。为了准确地注释蛋白质,整合这些异构数据源是必要且明智的。本文提出了一种多源蛋白质全局传播(MS-PGP)算法,该算法集成了多个数据源,并将蛋白质全局传播与标记相关性(PGP)算法相结合来预测未注释蛋白质的功能。具体来说,我们使用三个数据源来预测蛋白质功能:序列数据,微阵列基因表达数据和蛋白质-蛋白质相互作用数据。采用朴素的贝叶斯方式将三个数据源融合为一个组合网络。基因本体生物学过程注释用于计算未注释的蛋白质和功能之间的关联评分。在Yeast上的实验结果表明,该方法具有比其他多种网络方法更高的准确性。预测未注释蛋白的功能非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号