...
首页> 外文期刊>Molecules >Protein Quadratic Indices of the “Macromolecular Pseudograph’s α-Carbon Atom Adjacency Matrix”. 1. Prediction of Arc Repressor Alanine-mutant’s Stability
【24h】

Protein Quadratic Indices of the “Macromolecular Pseudograph’s α-Carbon Atom Adjacency Matrix”. 1. Prediction of Arc Repressor Alanine-mutant’s Stability

机译:“大分子伪谱仪的α-碳原子邻接矩阵”的蛋白质二次指数。 1.阻弧剂丙氨酸突变体的稳定性预测

获取原文

摘要

This report describes a new set of macromolecular descriptors of relevance to protein QSAR/QSPR studies, protein’s quadratic indices. These descriptors are calculated from the macromolecular pseudograph’s α-carbon atom adjacency matrix. A study of the protein stability effects for a complete set of alanine substitutions in Arc repressor illustrates this approach. Quantitative Structure-Stability Relationship (QSSR) models allow discriminating between near wild-type stability and reduced-stability A-mutants. A linear discriminant function gives rise to excellent discrimination between 85.4% (35/41) and 91.67% (11/12) of near wild-type stability/reduced stability mutants in training and test series, respectively. The model’s overall predictability oscillates from 80.49 until 82.93, when n varies from 2 to 10 in leave-n-out cross validation procedures. This value stabilizes around 80.49% when n was 6. Additionally, canonical regression analysis corroborates the statistical quality of the classification model (Rcanc = 0.72, p-level 0.0001). This analysis was also used to compute biological stability canonical scores for each Arc A-mutant. On the other hand, nonlinear piecewise regression model compares favorably with respect to linear regression one on predicting the melting temperature (tm) of the Arc A-mutants. The linear model explains almost 72% of the variance of the experimental tm (R = 0.85 and s = 5.64) and LOO press statistics evidenced its predictive ability (q2 = 0.55 and scv = 6.24). However, this linear regression model falls to resolve tm predictions of Arc A-mutants in external prediction series. Therefore, the use of nonlinear piecewise models was required. The tm values of A-mutants in training (R = 0.94) and test (R = 0.91) sets are calculated by piecewise model with a high degree of precision. A break-point value of 51.32 oC characterizes two mutants’ clusters and coincides perfectly with the experimental scale. For this reason, we can use the linear discriminant analysis and piecewise models in combination to classify and predict the stability of the mutants’ Arc homodimers. These models also permit the interpretation of the driving forces of such a folding process. The models include protein’s quadratic indices accounting for hydrophobic (z1), bulk-steric (z2), and electronic (z3) features of the studied molecules. Preponderance of z1 and z3 over z2 indicates the higher importance of the hydrophobic and electronic side chain terms in the folding of the Arc dimer. In this sense, developed equations involve short-reaching (k ≤ 3), middle- reaching (3 k ≤ 7) and far-reaching (k = 8 or greater) z1, 2, 3-protein’s quadratic indices. This situation points to topologic/topographic protein’s backbone interactions control of the stability profile of wild-type Arc and its A-mutants. Consequently, the present approach represents a novel and very promising way to mathematical research in biology sciences.
机译:该报告描述了与蛋白质QSAR / QSPR研究相关的一组新的大分子描述符,即蛋白质的二次指数。这些描述符是根据大分子伪图的α-碳原子邻接矩阵计算得出的。对Arc阻遏物中完整的丙氨酸取代的蛋白质稳定性影响的研究说明了这种方法。定量结构-稳定性关系(QSSR)模型可以区分近乎野生型的稳定性和降低的稳定性的A突变体。线性判别函数分别在训练和测试系列中分别产生了85.4%(35/41)和91.67%(11/12)的近野生型稳定性/降低稳定性突变体之间的出色区分。该模型的整体可预测性在80.49至82.93之间波动,而在遗漏n交叉验证程序中,当n从2变为10时。当n> 6时,该值稳定在80.49%左右。此外,规范回归分析证实了分类模型的统计质量(Rcanc = 0.72,p水平<0.0001)。该分析还用于计算每个Arc A突变体的生物稳定性标准分数。另一方面,在预测Arc A突变体的熔化温度(t m )方面,非线性分段回归模型相对于线性回归而言具有优势。线性模型解释了实验t m 的近72%的方差(R = 0.85,s = 5.64),LOO新闻统计证明了其预测能力(q 2 = 0.55和s cv = 6.24)。但是,该线性回归模型无法解决外部预测序列中的Arc A突变体的t m 个预测。因此,需要使用非线性分段模型。通过分段模型以高精确度计算出训练中的A突变体的t m 值(R = 0.94)和测试(R = 0.91)。断裂点值为51.32 o C表征了两个突变体的簇,与实验规模完全吻合。因此,我们可以结合使用线性判别分析和分段模型来分类和预测突变体的Arc同二聚体的稳定性。这些模型还可以解释这种折叠过程的驱动力。这些模型包括考虑疏水性(z 1 ),体空间(z 2 )和电子(z 3 )特征的蛋白质二次指数被研究的分子。 z 1 和z 3 在z 2 上的优势表明疏水和电子侧链术语在Arc二聚体折叠中的重要性更高。从这个意义上讲,已发展的方程式涉及短距离(k≤3),中距离(3 1,2,3 -蛋白质的二次指数。这种情况表明拓扑/地形蛋白的骨架相互作用控制了野生型弧及其A突变体的稳定性。因此,本方法代表了生物学科学中数学研究的一种新颖且非常有前途的方法。

著录项

相似文献

  • 外文文献
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号