首页> 美国卫生研究院文献>International Journal of Molecular Sciences >A Machine Learning Approach for Hot-Spot Detection at Protein-Protein Interfaces
【2h】

A Machine Learning Approach for Hot-Spot Detection at Protein-Protein Interfaces

机译:一种在蛋白质-蛋白质界面上进行热点检测的机器学习方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Understanding protein-protein interactions is a key challenge in biochemistry. In this work, we describe a more accurate methodology to predict Hot-Spots (HS) in protein-protein interfaces from their native complex structure compared to previous published Machine Learning (ML) techniques. Our model is trained on a large number of complexes and on a significantly larger number of different structural- and evolutionary sequence-based features. In particular, we added interface size, type of interaction between residues at the interface of the complex, number of different types of residues at the interface and the Position-Specific Scoring Matrix (PSSM), for a total of 79 features. We used twenty-seven algorithms from a simple linear-based function to support-vector machine models with different cost functions. The best model was achieved by the use of the conditional inference random forest (c-forest) algorithm with a dataset pre-processed by the normalization of features and with up-sampling of the minor class. The method has an overall accuracy of 0.80, an F1-score of 0.73, a sensitivity of 0.76 and a specificity of 0.82 for the independent test set.
机译:了解蛋白质之间的相互作用是生物化学中的关键挑战。在这项工作中,我们描述了一种较准确的方法,与以前发布的机器学习(ML)技术相比,可以从蛋白质-蛋白质界面的天然复杂结构预测热点(HS)。我们的模型是针对大量复合物以及大量基于结构和进化序列的不同特征进行训练的。特别是,我们增加了界面大小,复合物界面上残基之间的相互作用类型,界面上不同类型残基的数量以及特定位置计分矩阵(PSSM),总共提供了79个功能。从简单的基于线性的函数到具有不同成本函数的支持向量机模型,我们使用了27种算法。最佳模型是通过使用条件推断随机森林(c-forest)算法获得的,该算法的数据集通过特征的归一化和次要类的上采样进行了预处理。该方法的总体准确度为0.80,F1评分为0.73,灵敏度为0.76,独立测试集的特异性为0.82。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号