...
首页> 外文期刊>Medical science monitor : >A tabular approach to the sequence-to-structure relation in proteins (tetrapeptide representation) for de novo protein design.
【24h】

A tabular approach to the sequence-to-structure relation in proteins (tetrapeptide representation) for de novo protein design.

机译:一种用于从头进行蛋白质设计的蛋白质序列与结构关系的表格方法(四肽表示)。

获取原文
   

获取外文期刊封面封底 >>

       

摘要

BACKGROUND: Experimental observations classify the protein-folding process as a multi-step event. The backbone conformation has been experimentally recognized as responsible for the early-stage structural forms of a polypeptide. The sequence-to-structure and structure-to-sequence relation is critical for predicting protein structure. A contingency table representing this relation for tetrapeptides in their early-stage is presented. Their correlation seems to be essential in protein-folding simulation. MATERIAL/METHODS: The polypeptide chains of all the proteins in the Protein Data Bank were transformed into their early-stage structural forms. The tetrapeptide was selected as the structural unit. Tetrapetide sequences and structures were expressed by letter codes. The transformation of a contingency table of any size (here: 160,000x2401) to a 2x2 table performed for each non-zero cell of the original table allowed calculation of the rho-coefficient measuring the strength of the relation. RESULTS: High values of the rho-coefficient extracted sequences of strong structural determinability and structures of high sequence selectivity. The web-site program to calculate the rho-coefficient ranking list was constructed to enable applying this method to any problem of contingency table analysis. CONCLUSIONS: The results revealed sequence-to-structure (and vice versa) correlation in early-stage folding. Surprisingly, the irregular structural forms of loops and bends appeared to be highly determined. Comparison of these results with another method based on information entropy revealed high accordance. The method oriented on interpretation of a large contingency table seems very useful especially for large-scale microarray analysis, a very popular technique in the post-genomic era.
机译:背景:实验观察将蛋白质折叠过程归类为多步骤事件。骨架构象已被实验认为是造成多肽早期结构形式的原因。序列与结构和结构与序列的关系对于预测蛋白质结构至关重要。列出了代表四肽在其早期阶段的这种关系的列联表。它们的相关性似乎在蛋白质折叠模拟中至关重要。材料/方法:蛋白质数据库中所有蛋白质的多肽链均已转化为其早期结构形式。选择四肽作为结构单元。四肽序列和结构用字母代码表示。对原始表的每个非零像元执行任何大小的列联表(此处为:160,000x2401)到2x2表的转换,都可以计算出测量该关系强度的rho系数。结果:高值的rho系数提取的序列具有很强的结构可确定性和高序列选择性的结构。构建了用于计算rho系数排名列表的网站程序,以将这种方法应用于任何列联表分析问题。结论:结果揭示了早期折叠中序列与结构的相关性(反之亦然)。令人惊讶的是,环状和弯曲的不规则结构形式似乎是高度确定的。这些结果与另一种基于信息熵的方法的比较显示出高度的一致性。面向大列联表的解释的方法似乎非常有用,特别是对于大规模微阵列分析,这是后基因组时代的一种非常流行的技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号