...
首页> 外文期刊>Journal of Molecular Biology >A 3D-1D SUBSTITUTION MATRIX FOR PROTEIN FOLD RECOGNITION THAT INCLUDES PREDICTED SECONDARY STRUCTURE OF THE SEQUENCE
【24h】

A 3D-1D SUBSTITUTION MATRIX FOR PROTEIN FOLD RECOGNITION THAT INCLUDES PREDICTED SECONDARY STRUCTURE OF THE SEQUENCE

机译:用于蛋白质折叠识别的3D-1D替换矩阵,包括预期的序列二级结构

获取原文
获取原文并翻译 | 示例

摘要

In protein fold recognition, a probe amino acid sequence is compared to a Library of representative folds of known structure to identify a structural homolog. In cases where the probe and its homolog have clear sequence similarity, traditional residue substitution matrices have been used to predict the structural similarity. In cases where the probe is sequentially distant from its homolog, we have developed a (7 x 3 x 2 x 7 x 3) 3D-1D substitution matrix (called H3P2), calculated from a database of 119 structural pairs. Members of each pair share a similar fold, but have sequence identity less than 30%. Each probe sequence position is defined by one of seven residue classes and three secondary structure classes. Each homologous fold position is defined by one of seven residue classes, three secondary structure classes, and two burial classes. Thus the matrix is five-dimensional and contains 7 x 3 x 2 x 7 x 3 = 882 elements or 3D-1D scores. The first step in assigning a probe sequence to its homologous fold is the prediction of the three-state (helix, strand, coil) secondary structure of the probe; here we use the profile based neural network prediction of secondary structure (PHD) program. Then a dynamic programming algorithm uses the H3P2 matrix to align the probe sequence with structures in a representative fold library. To test the effectiveness of the H3P2 matrix a challenging, fold class diverse, and cross-validated benchmark assessment is used to compare the H3P2 matrix to the GONNET, PAM250, BLOSUM62 and a secondary structure only substitution matrix. For distantly related sequences the H3P2 matrix detects more homologous structures at higher reliabilities than do these other substitution matrices, based on sensitivity versus specificity plots (or SENS-SPEC plots). The added efficacy of the H3P2 matrix arises from its information on the statistical preferences for various sequence-structure environment combinations from very distantly related proteins. It introduces the predicted secondary structure information from a sequence into fold recognition in a statistical way that normalizes the inherent correlations between residue type, secondary structure and solvent accessibility. (C) 1997 Academic Press Limited. [References: 49]
机译:在蛋白质折叠识别中,将探针氨基酸序列与已知结构的代表性折叠文库进行比较,以鉴定结构同源物。在探针及其同源物具有明确的序列相似性的情况下,传统的残基取代矩阵已用于预测结构相似性。如果探针与同源序列相距较远,我们开发了(7 x 3 x 2 x 7 x 3)3D-1D取代矩阵(称为H3P2),该矩阵是根据119个结构对的数据库计算得出的。每对成员共享相似的折叠,但序列同一性小于30%。每个探针序列位置由七个残基类别和三个二级结构类别之一定义。每个同源折叠位置由七个残基类别,三个二级结构类别和两个埋葬类别之一定义。因此,矩阵是五维的,包含7 x 3 x 2 x 7 x 3 = 882个元素或3D-1D分数。将探针序列赋予其同源折叠的第一步是预测探针的三态(螺旋,链,螺旋)二级结构。在这里,我们使用基于轮廓的神经网络预测二级结构(PHD)程序。然后,动态编程算法使用H3P2矩阵将探针序列与代表性折叠文库中的结构进行比对。为了测试H3P2矩阵的有效性,使用了具有挑战性,折叠级别多样化且经过交叉验证的基准评估,以将H3P2矩阵与GONNET,PAM250,BLOSUM62和仅用于二级结构的替换矩阵进行比较。对于远距离相关的序列,H3P2矩阵基于灵敏度与特异性的关系图(或SENS-SPEC图),以比其他替代矩阵更高的可靠性检测更多的同源结构。 H3P2矩阵的附加功效源于其有关与非常遥远相关的蛋白质的各种序列结构环境组合的统计偏好有关的信息。它以统计方式将序列中预测的二级结构信息引入折叠识别中,以标准化残基类型,二级结构和溶剂可及性之间的内在联系。 (C)1997 Academic Press Limited。 [参考:49]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号