...
首页> 外文期刊>Current Protein and Peptide Science >Prediction of Protein-Protein Interactions Based on Protein-Protein Correlation Using Least Squares Regression
【24h】

Prediction of Protein-Protein Interactions Based on Protein-Protein Correlation Using Least Squares Regression

机译:基于最小二乘回归的蛋白质-蛋白质相关性预测蛋白质-蛋白质相互作用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In order to transform protein sequences into the feature vectors, several works have been done, such as computing auto covariance (AC), conjoint triad (CT), local descriptor (LD), moran autocorrelation (MA), normalized moreau-broto autocorrelation (NMB) and so on. In this paper, we shall adopt these transformation methods to encode the proteins, respectively, where AC, CT, LD, MA and NMB are all represented by '+' in a unified manner. A new method, i.e. the combination of least squares regression with '+' (abbreviated as LSR~+), will be introduced for encoding a protein-protein correlation-based feature representation and an interacting protein pair. Thus there are totally five different combinations for LSR~+, i.e. LSRAC, LSRCT, LSRLD, LSRMA and LSRNMB. As a result, we combined a support vector machine (SVM) approach with LSR~+ to predict protein-protein interactions (PPI) and PP1 networks. The proposed method has been applied on four datasets, i.e. Saaccharomyces cerevisiae, Escherichia coli, Homo sapiens and Caenorhabditis elegans. The experimental results demonstrate that all LSR~+ methods outperform many existing representative algorithms. Therefore, LSR~+ is a powerful tool to characterize the protein-protein correlations and to infer PPI, whilst keeping high performance on prediction of PPI networks.
机译:为了将蛋白质序列转化为特征向量,已经完成了多项工作,例如计算自协方差(AC),联合三联体(CT),局部描述符(LD),莫兰自相关(MA),归一化moreau-broto自相关( NMB)等等。在本文中,我们将采用这些转化方法分别编码蛋白质,其中AC,CT,LD,MA和NMB均以统一的“ +”表示。将会引入一种新的方法,即最小二乘回归与“ +”(缩写为LSR〜+)的组合,用于编码基于蛋白质-蛋白质相关性的特征表示和相互作用的蛋白质对。因此,对于LSR_ +,共有五种不同的组合,即LSRAC,LSRCT,LSRLD,LSRMA和LSRNMB。结果,我们将支持向量机(SVM)方法与LSR〜+相结合,以预测蛋白质-蛋白质相互作用(PPI)和PP1网络。所提出的方法已经应用于四个数据集,即啤酒酵母,大肠杆菌,智人和秀丽隐杆线虫。实验结果表明,所有LSR〜+方法均优于许多现有的代表性算法。因此,LSR〜+是表征蛋白质与蛋白质相关性并推断PPI的有力工具,同时在预测PPI网络方面保持高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号