首页> 外文期刊>BMC Bioinformatics >Automatic generation of 3D motifs for classification of protein binding sites
【24h】

Automatic generation of 3D motifs for classification of protein binding sites

机译:自动生成用于蛋白质结合位点分类的3D主题

获取原文
           

摘要

Background Since many of the new protein structures delivered by high-throughput processes do not have any known function, there is a need for structure-based prediction of protein function. Protein 3D structures can be clustered according to their fold or secondary structures to produce classes of some functional significance. A recent alternative has been to detect specific 3D motifs which are often associated to active sites. Unfortunately, there are very few known 3D motifs, which are usually the result of a manual process, compared to the number of sequential motifs already known. In this paper, we report a method to automatically generate 3D motifs of protein structure binding sites based on consensus atom positions and evaluate it on a set of adenine based ligands. Results Our new approach was validated by generating automatically 3D patterns for the main adenine based ligands, i.e. AMP, ADP and ATP. Out of the 18 detected patterns, only one, the ADP4 pattern, is not associated with well defined structural patterns. Moreover, most of the patterns could be classified as binding site 3D motifs. Literature research revealed that the ADP4 pattern actually corresponds to structural features which show complex evolutionary links between ligases and transferases. Therefore, all of the generated patterns prove to be meaningful. Each pattern was used to query all PDB proteins which bind either purine based or guanine based ligands, in order to evaluate the classification and annotation properties of the pattern. Overall, our 3D patterns matched 31% of proteins with adenine based ligands and 95.5% of them were classified correctly. Conclusion A new metric has been introduced allowing the classification of proteins according to the similarity of atomic environment of binding sites, and a methodology has been developed to automatically produce 3D patterns from that classification. A study of proteins binding adenine based ligands showed that these 3D patterns are not only biochemically meaningful, but can be used for protein classification and annotation.
机译:背景技术由于通过高通量方法递送的许多新蛋白质结构不具有任何已知功能,因此需要基于结构的蛋白质功能预测。蛋白质3D结构可以根据其折叠或二级结构进行聚类,以产生具有一定功能意义的类别。最近的替代方法是检测通常与活动位点相关的特定3D主题。不幸的是,与已知的顺序主题数量相比,很少有已知的3D主题,通常是手动处理的结果。在本文中,我们报告了一种基于共有原子位置自动生成蛋白质结构结合位点的3D图案的方法,并基于一组基于腺嘌呤的配体对其进行了评估。结果我们的新方法通过自动生成基于腺嘌呤的主要配体(即AMP,ADP和ATP)的3D模式进行了验证。在18种检测到的模式中,只有一种ADP4模式与定义明确的结构模式无关。此外,大多数图案都可以归类为结合位点3D图案。文献研究表明,ADP4模式实际上与显示连接酶和转移酶之间复杂的进化联系的结构特征相对应。因此,所有生成的模式都被证明是有意义的。每种模式用于查询所有结合嘌呤或鸟嘌呤配体的PDB蛋白,以评估模式的分类和注释特性。总体而言,我们的3D模式将31%的蛋白质与基于腺嘌呤的配体相匹配,其中95.5%的蛋白质被正确分类。结论引入了一种新的度量标准,可以根据结合位点原子环境的相似性对蛋白质进行分类,并且已经开发了一种从该分类自动产生3D模式的方法。对结合基于腺嘌呤的配体的蛋白质的研究表明,这些3D模式不仅在生物化学上有意义,而且可以用于蛋白质分类和注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号