Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment

Yang Jianyi; Roy Ambrish; Zhang Yang

首页> 外文期刊>Bioinformatics >Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment

【24h】

Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment

机译：使用互补结合特异性亚结构比较和序列图谱比对的蛋白质-配体结合位点识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: Identification of protein-ligand binding sites is critical to protein function annotation and drug discovery. However, there is no method that could generate optimal binding site prediction for different protein types. Combination of complementary predictions is probably the most reliable solution to the problem. Results: We develop two new methods, one based on binding-specific substructure comparison (TM-SITE) and another on sequence profile alignment (S-SITE), for complementary binding site predictions. The methods are tested on a set of 500 non-redundant proteins harboring 814 natural, drug-like and metal ion molecules. Starting from low-resolution protein structure predictions, the methods successfully recognize >51% of binding residues with average Matthews correlation coefficient (MCC) significantly higher (with P-value >10(-9) in student t-test) than other state-of-the-art methods, including COFACTOR, FINDSITE and ConCavity. When combining TM-SITE and S-SITE with other structure-based programs, a consensus approach (COACH) can increase MCC by 15% over the best individual predictions. COACH was examined in the recent community-wide COMEO experiment and consistently ranked as the best method in last 22 individual datasets with the Area Under the Curve score 22.5% higher than the second best method. These data demonstrate a new robust approach to protein-ligand binding site recognition, which is ready for genome-wide structure-based function annotations.

机译：动机：鉴定蛋白质-配体结合位点对于蛋白质功能注释和药物发现至关重要。但是，没有方法可以为不同的蛋白质类型生成最佳的结合位点预测。互补预测的组合可能是该问题的最可靠解决方案。结果：我们开发了两种新方法，一种基于结合特异性亚结构比较（TM-SITE），另一种基于序列谱比对（S-SITE），用于互补结合位点预测。该方法在一组500种非冗余蛋白质上进行了测试，这些蛋白质包含814种天然，类药物和金属离子分子。从低分辨率的蛋白质结构预测开始，这些方法成功地识别了> 51％的结合残基，其平均Matthews相关系数（MCC）明显高于其他状态，其中在学生t检验中，P值> 10（-9）。最先进的方法，包括COFACTOR，FINDSITE和ConCavity。将TM-SITE和S-SITE与其他基于结构的程序结合使用时，共识方法（COACH）可使MCC比最佳的个人预测高15％。在最近的整个社区COMEO实验中对COACH进行了检查，并在过去22个单独的数据集中始终被评为最佳方法，“曲线下面积”得分比第二最佳方法高22.5％。这些数据证明了一种新的鲁棒的蛋白质-配体结合位点识别方法，可用于基于全基因组结构的功能注释。

著录项

来源
《Bioinformatics》 |2013年第20期|共8页
作者
Yang Jianyi; Roy Ambrish; Zhang Yang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;
关键词

相似文献

外文文献
中文文献
专利

1. Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment [J] . Yang Jianyi, Roy Ambrish, Zhang Yang Bioinformatics . 2013,第20期

机译：使用互补结合特异性亚结构比较和序列图谱比对的蛋白质-配体结合位点识别
2. Alignment-Free Ultra-High-Throughput Comparison of Druggable Protein-Ligand Binding Sites [J] . Weill N, Rognan D Journal of chemical information and modeling . 2010,第1期

机译：可比对的蛋白-配体结合位点的无比对超高通量比较
3. ATPbind: Accurate Protein–ATP Binding Site Prediction by Combining Sequence-Profiling and Structure-Based Comparisons [J] . Jun Hu, Yang Li, Yang Zhang, Journal of chemical information and modeling . 2018,第2期

机译：ATPBIND：通过组合序列分析和基于结构的比较来精确蛋白-ATP结合位点预测
4. Water participation in molecular recognition and protein-ligand association: Probing the drug binding site 'Sudlow I' in human serum albumin [C] . Najla Al-Lawatia, Thomas Steinbrecher, Osama K. Abou-Zied Reporters, markers, dyes, nanoparticles, and molecular probes for biomedical applications IV . 2012

机译：水参与分子识别和蛋白质-配体缔合：探测人血清白蛋白中的药物结合位点“ Sudlow I”
5. Development of the property encoded shape distributions and their application to protein binding site comparison and protein-ligand binding affinity prediction. [D] . Das, Sourav. 2010

机译：该属性的编码形状分布的发展及其在蛋白质结合位点比较和蛋白质-配体结合亲和力预测中的应用。
6. Protein–ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment [O] . Jianyi Yang, Ambrish Roy, Yang Zhang -1

机译：蛋白质-配体结合位点识别使用互补结合特异性亚结构比较和序列谱比对
7. Figure 4: (A) One conserved sequence, which occurs 79 times in 46,264 binding site peaks from the ChIP-seq data-set. The mutation profile of this conserved sequence is illustrated, where ’_ ’ indicates this base is unchanged; DEL indicates this base is lost; INS X indicates a new base X is inserted in front of this base. (B) Several repeated elements patterns are listed. (C) In the first column, the top five DNA motifs, mined by meme-chip tools (Machanick Bailey, 2011) are illustrated. The resemblant conserved sequences, found by the CFSP algorithm are listed in the second column. In the third column, the position-specific scoring matrices, which are transformed from mutational information are listed. The similarity between meme motif and resemblant conserved sequence with PSSM format was calculated via a stamp motif comparison tool (Mahony Benos, 2007). The E-values for the similarity of those pairs is displayed in the fourth column. (D) One motif is selected in each group clustered by gkmsvm descriptors, and the corresponding motif found by the CFSP algorithm is listed below. (E) There are additional datasets (File No: ENCFF100GRL, ENCFF616IRT, ENCFF870CER, Target: SREBF1) collected from https://www.encodeproject.org. The top two motifs are selected in each file using meme tools, and the corresponding motifs found by our algorithm are listed below. [O] . -1

机译：图4：（a）一种保守序列，其发生在芯片-SEQ数据集中的46,264个结合位点峰值中的79倍。说明了这种保守序列的突变分布，其中'_'表示该碱度不变; del表示此基础丢失; INS X表示新的基础X插入此基础前面。（b）列出了几种重复的元素模式。（c）在第一栏中，示出了由MEME芯片工具（Machanick＆Bailey，2011）开采的前五个DNA主题。由CFSP算法发现的相应保守序列列于第二列中。在第三列中，列出了从突变信息转换的特定位置的评分矩阵。 MEME主题与PSSM格式的相似性与PSSM格式之间的相似性通过邮票图章比较工具（Mahony＆Benos，2007）计算。这些对相似性的电子值显示在第四列中。（d）在由GKMSVM描述符聚集的每个组中选择了一个图案，下面列出了CFSP算法的相应主题。（e）从https://www.encodeproject.org收集的，有附加数据集（文件no：cernff100grl，cenf616irl，conf8.20cer，target：srebf1）。使用MEME工具在每个文件中选择前两个图案，并且我们的算法发现的相应主题如下所示。

Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment

摘要

著录项

相似文献

相关主题

期刊订阅