首页> 外文期刊>Bioinformatics >Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment
【24h】

Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment

机译:使用互补结合特异性亚结构比较和序列图谱比对的蛋白质-配体结合位点识别

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Identification of protein-ligand binding sites is critical to protein function annotation and drug discovery. However, there is no method that could generate optimal binding site prediction for different protein types. Combination of complementary predictions is probably the most reliable solution to the problem. Results: We develop two new methods, one based on binding-specific substructure comparison (TM-SITE) and another on sequence profile alignment (S-SITE), for complementary binding site predictions. The methods are tested on a set of 500 non-redundant proteins harboring 814 natural, drug-like and metal ion molecules. Starting from low-resolution protein structure predictions, the methods successfully recognize >51% of binding residues with average Matthews correlation coefficient (MCC) significantly higher (with P-value >10(-9) in student t-test) than other state-of-the-art methods, including COFACTOR, FINDSITE and ConCavity. When combining TM-SITE and S-SITE with other structure-based programs, a consensus approach (COACH) can increase MCC by 15% over the best individual predictions. COACH was examined in the recent community-wide COMEO experiment and consistently ranked as the best method in last 22 individual datasets with the Area Under the Curve score 22.5% higher than the second best method. These data demonstrate a new robust approach to protein-ligand binding site recognition, which is ready for genome-wide structure-based function annotations.
机译:动机:鉴定蛋白质-配体结合位点对于蛋白质功能注释和药物发现至关重要。但是,没有方法可以为不同的蛋白质类型生成最佳的结合位点预测。互补预测的组合可能是该问题的最可靠解决方案。结果:我们开发了两种新方法,一种基于结合特异性亚结构比较(TM-SITE),另一种基于序列谱比对(S-SITE),用于互补结合位点预测。该方法在一组500种非冗余蛋白质上进行了测试,这些蛋白质包含814种天然,类药物和金属离子分子。从低分辨率的蛋白质结构预测开始,这些方法成功地识别了> 51%的结合残基,其平均Matthews相关系数(MCC)明显高于其他状态,其中在学生t检验中,P值> 10(-9)。最先进的方法,包括COFACTOR,FINDSITE和ConCavity。将TM-SITE和S-SITE与其他基于结构的程序结合使用时,共识方法(COACH)可使MCC比最佳的个人预测高15%。在最近的整个社区COMEO实验中对COACH进行了检查,并在过去22个单独的数据集中始终被评为最佳方法,“曲线下面积”得分比第二最佳方法高22.5%。这些数据证明了一种新的鲁棒的蛋白质-配体结合位点识别方法,可用于基于全基因组结构的功能注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号