首页> 美国卫生研究院文献>Molecules >Protein Quadratic Indices of the Macromolecular Pseudograph’s α-Carbon Atom Adjacency Matrix. 1. Prediction of Arc Repressor Alanine-mutant’s Stability
【2h】

Protein Quadratic Indices of the Macromolecular Pseudograph’s α-Carbon Atom Adjacency Matrix. 1. Prediction of Arc Repressor Alanine-mutant’s Stability

机译:大分子伪装者的α-碳原子邻接矩阵的蛋白质二次指数。 1.阻弧剂丙氨酸突变体的稳定性预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This report describes a new set of macromolecular descriptors of relevance to protein QSAR/QSPR studies, protein’s quadratic indices. These descriptors are calculated from the macromolecular pseudograph’s α-carbon atom adjacency matrix. A study of the protein stability effects for a complete set of alanine substitutions in Arc repressor illustrates this approach. Quantitative Structure-Stability Relationship (QSSR) models allow discriminating between near wild-type stability and reduced-stability A-mutants. A linear discriminant function gives rise to excellent discrimination between 85.4% (35/41) and 91.67% (11/12) of near wild-type stability/reduced stability mutants in training and test series, respectively. The model’s overall predictability oscillates from 80.49 until 82.93, when n varies from 2 to 10 in leave-n-out cross validation procedures. This value stabilizes around 80.49% when n was > 6. Additionally, canonical regression analysis corroborates the statistical quality of the classification model (Rcanc = 0.72, p-level <0.0001). This analysis was also used to compute biological stability canonical scores for each Arc A-mutant. On the other hand, nonlinear piecewise regression model compares favorably with respect to linear regression one on predicting the melting temperature (tm) of the Arc A-mutants. The linear model explains almost 72% of the variance of the experimental tm (R = 0.85 and s = 5.64) and LOO press statistics evidenced its predictive ability (q2 = 0.55 and scv = 6.24). However, this linear regression model falls to resolve tm predictions of Arc A-mutants in external prediction series. Therefore, the use of nonlinear piecewise models was required. The tm values of A-mutants in training (R = 0.94) and test (R = 0.91) sets are calculated by piecewise model with a high degree of precision. A break-point value of 51.32 oC characterizes two mutants’ clusters and coincides perfectly with the experimental scale. For this reason, we can use the linear discriminant analysis and piecewise models in combination to classify and predict the stability of the mutants’ Arc homodimers. These models also permit the interpretation of the driving forces of such a folding process. The models include protein’s quadratic indices accounting for hydrophobic (z1), bulk-steric (z2), and electronic (z3) features of the studied molecules. Preponderance of z1 and z3 over z2 indicates the higher importance of the hydrophobic and electronic side chain terms in the folding of the Arc dimer. In this sense, developed equations involve short-reaching (k ≤ 3), middle- reaching (3 < k ≤ 7) and far-reaching (k = 8 or greater) z1, 2, 3-protein’s quadratic indices. This situation points to topologic/topographic protein’s backbone interactions control of the stability profile of wild-type Arc and its A-mutants. Consequently, the present approach represents a novel and very promising way to mathematical research in biology sciences.
机译:该报告描述了与蛋白质QSAR / QSPR研究相关的一组新的大分子描述符,即蛋白质的二次指数。这些描述符是根据大分子伪图的α-碳原子邻接矩阵计算得出的。对Arc阻遏物中完整的丙氨酸取代的蛋白质稳定性影响的研究说明了这种方法。定量结构-稳定性关系(QSSR)模型可以区分近乎野生型的稳定性和稳定性降低的A突变体。线性判别函数分别在训练和测试系列中分别产生了85.4%(35/41)和91.67%(11/12)的近野生型稳定性/降低稳定性突变体之间的出色区分。该模型的总体可预测性在80.49至82.93之间波动,而在遗漏n交叉验证程序中,当n从2变为10时。当n> 6时,该值稳定在80.49%左右。此外,规范回归分析证实了分类模型的统计质量(Rcanc = 0.72,p水平<0.0001)。该分析还用于计算每个Arc A突变体的生物学稳定性标准分数。另一方面,在预测Arc A突变体的熔解温度(tm)方面,非线性分段回归模型相对于线性回归具有较好的比较性。线性模型解释了实验tm的几乎72%的方差(R = 0.85和s = 5.64),LOO新闻统计证明了其预测能力(q 2 = 0.55和scv = 6.24)。但是,此线性回归模型无法解决外部预测系列中的Arc A突变体的tm预测。因此,需要使用非线性分段模型。训练(R = 0.94)和测试(R = 0.91)组中A突变体的tm值通过分段模型以高精度进行计算。断裂点值为51.32 o C表征了两个突变体的簇,与实验规模完全吻合。因此,我们可以结合使用线性判别分析和分段模型来分类和预测突变体的Arc同二聚体的稳定性。这些模型还可以解释这种折叠过程的驱动力。这些模型包括考虑到所研究分子的疏水性(z1),整体空间(z2)和电子(z3)特征的蛋白质二次指数。 z1和z3优于z2表示疏水和电子侧链术语在Arc二聚体折叠中的重要性更高。从这个意义上讲,发达的方程式涉及短距离(k≤3),中距离(3

著录项

相似文献

  • 外文文献
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号