首页> 美国卫生研究院文献>PLoS Clinical Trials >Bayesian Markov Random Field Analysis for Protein Function PredictionBased on Network Data
【2h】

Bayesian Markov Random Field Analysis for Protein Function PredictionBased on Network Data

机译:贝叶斯马尔可夫随机场分析用于蛋白质功能预测基于网络数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Inference of protein functions is one of the most important aims of modern biology. To fully exploit the large volumes of genomic data typically produced in modern-day genomic experiments, automated computational methods for protein function prediction are urgently needed. Established methods use sequence or structure similarity to infer functions but those types of data do not suffice to determine the biological context in which proteins act. Current high-throughput biological experiments produce large amounts of data on the interactions between proteins. Such data can be used to infer interaction networks and to predict the biological process that the protein is involved in. Here, we develop a probabilistic approach for protein function prediction using network data, such as protein-protein interaction measurements. We take a Bayesian approach to an existing Markov Random Field method by performing simultaneous estimation of the model parameters and prediction of protein functions. We use an adaptive Markov Chain Monte Carlo algorithm that leads to more accurate parameter estimates and consequently to improved prediction performance compared to the standard Markov Random Fields method. We tested our method using a high quality S.cereviciae validation networkwith 1622 proteins against 90 Gene Ontology terms of different levels ofabstraction. Compared to three other protein function prediction methods, ourapproach shows very good prediction performance. Our method can be directlyapplied to protein-protein interaction or coexpression networks, but also can beextended to use multiple data sources. We apply our method to physical proteininteraction data from S. cerevisiae and provide novelpredictions, using 340 Gene Ontology terms, for 1170 unannotated proteins and weevaluate the predictions using the available literature.
机译:蛋白质功能的推断是现代生物学的最重要目标之一。为了充分利用现代基因组实验中通常产生的大量基因组数据,迫切需要用于蛋白质功能预测的自动计算方法。既定的方法使用序列或结构相似性来推断功能,但是这些类型的数据不足以确定蛋白质起作用的生物学环境。当前的高通量生物学实验产生了大量有关蛋白质之间相互作用的数据。此类数据可用于推断相互作用网络并预测蛋白质所涉及的生物学过程。在这里,我们开发了一种使用网络数据(例如蛋白质-蛋白质相互作用测量)进行蛋白质功能预测的概率方法。通过对模型参数同时进行估计和预测蛋白质功能,我们对现有的马尔可夫随机场方法采用了贝叶斯方法。与标准的马尔可夫随机场方法相比,我们使用了自适应马尔可夫链蒙特卡罗算法,该算法可导致更准确的参数估计,从而提高预测性能。我们使用高质量的啤酒酵母验证网络测试了我们的方法含有1622种蛋白质,分别针对90种不同程度的基因本体论术语抽象。与其他三种蛋白质功能预测方法相比,我们的该方法显示出非常好的预测性能。我们的方法可以直接应用于蛋白质-蛋白质相互作用或共表达网络,但也可以扩展为使用多个数据源。我们将我们的方法应用于物理蛋白质酿酒酵母的相互作用数据并提供新颖使用340个基因本体论术语对1170个未注释蛋白进行预测,我们使用现有文献评估预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号