...
首页> 外文期刊>Protein Science: A Publication of the Protein Society >Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model
【24h】

Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model

机译:使用蛋白质进化信息和相关矢量机模型改善蛋白质 - 蛋白质相互作用预测准确性

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Predicting protein-protein interactions (PPIs) is a challenging task and essential to construct the protein interaction networks, which is important for facilitating our understanding of the mechanisms of biological systems. Although a number of high-throughput technologies have been proposed to predict PPIs, there are unavoidable shortcomings, including high cost, time intensity, and inherently high false positive rates. For these reasons, many computational methods have been proposed for predicting PPIs. However, the problem is still far from being solved. In this article, we propose a novel computational method called RVM-BiGP that combines the relevance vector machine (RVM) model and Bi-gram Probabilities (BiGP) for PPIs detection from protein sequences. The major improvement includes (1) Protein sequences are represented using the Bi-gram probabilities (BiGP) feature representation on a Position Specific Scoring Matrix (PSSM), in which the protein evolutionary information is contained; (2) For reducing the influence of noise, the Principal Component Analysis (PCA) method is used to reduce the dimension of BiGP vector; (3) The powerful and robust Relevance Vector Machine (RVM) algorithm is used for classification. Five-fold cross-validation experiments executed on yeast and Helicobacter pylori datasets, which achieved very high accuracies of 94.57 and 90.57%, respectively. Experimental results are significantly better than previous methods. To further evaluate the proposed method, we compare it with the state-of-the-art support vector machine (SVM) classifier on the yeast dataset. The experimental results demonstrate that our RVM-BiGP method is significantly better than the SVM-based method. In addition, we achieved 97.15% accuracy on imbalance yeast dataset, which is higher than that of balance yeast dataset. The promising experimental results show the efficiency and robust of the proposed method, which can be an automatic decision support tool for future proteomics research. For facilitating extensive studies for future proteomics research, we developed a freely available web server called RVM-BiGP-PPIs in Hypertext Preprocessor (PHP) for predicting PPIs. The web server including source code and the datasets are available at .
机译:预测蛋白质 - 蛋白质相互作用(PPI)是一个具有挑战性的任务,构建蛋白质相互作用网络必须是必不可少的,这对于促进我们对生物系统机制的理解是重要的。尽管已经提出了许多高通量技术来预测PPI,但存在不可避免的缺点,包括高成本,时间强度和固有的高假阳性率。由于这些原因,已经提出了许多计算方法来预测PPI。但是,问题仍然远未解决。在本文中,我们提出了一种称为RVM-BIGP的新型计算方法,该方法将相关的向量机(RVM)模型和双克概率(BIGP)与蛋白质序列的PPI检测相结合。主要改进包括(1)使用位于特定得分矩阵(PSSM)上的双克概率(BIGP)特征表示来表示蛋白质序列,其中包含蛋白质进化信息; (2)为了降低噪声的影响,主要成分分析(PCA)方法用于减少BIGP载体的尺寸; (3)强大且强大的相关矢量机(RVM)算法用于分类。在酵母和幽门螺杆菌数据集上执行的五倍交叉验证实验,其分别实现了94.57和90.57%的高精度。实验结果明显优于以前的方法。为了进一步评估所提出的方法,我们将其与酵母数据集上的最先进的支持向量机(SVM)分类器进行比较。实验结果表明,我们的RVM-BIGP方法明显优于基于SVM的方法。此外,我们在不平衡酵母数据集中实现了97.15%的准确性,高于平衡酵母数据集。有希望的实验结果表明了该方法的效率和强大,可以是未来蛋白质组学研究的自动决策支持工具。为了促进对未来蛋白质组学研究的广泛研究,我们在超文本预处理器(PHP)中开发了一个可自由的Web服务器,用于预测PPI。包括源代码和数据集的Web服务器可用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号