首页> 外文学位 >High resolution force fields and residue contact prediction models for protein structure prediction.
【24h】

High resolution force fields and residue contact prediction models for protein structure prediction.

机译:用于蛋白质结构预测的高分辨率力场和残基接触预测模型。

获取原文
获取原文并翻译 | 示例

摘要

The process by which a protein acquires its stable and functional three dimensional structure is referred as the protein folding process. The understanding of protein folding process is one of the most important and challenging problems in computational biology. Various approaches to determine the three dimensional native structure of a protein from its amino acid sequence have been proposed. Some of these methods use existing experimentally-determined structures whereas, other ab initio methods do not rely on existing structures for their predictions.;Once the predictions have been made, it is very important to identify the best structure (most similar to the native structure) from an ensemble of predicted structures. Anfinsen's hypothesis states that the native structure corresponds to the global Gibbs free energy minima. A linear programming based model, that uses this hypothesis as the main criterion, has been developed to generate a Calpha-Calpha distance dependent force field. A diverse protein set and an improved decoy generation technique was employed to generate a challenging set of high quality training decoys. The Calpha-Calpha distance dependent force field generated using this model was found to be very successful in selecting native structures from an ensemble of high resolution conformers. Another linear programming based model has been developed to generate a side chain centroid distance dependent force field that includes the presence of side chain atoms of a residue. This force field was found to be more successful in discriminating between the native and non-native structures of a protein. These force fields can also be used for fold recognition and de novo protein design.;Protein structure prediction using first principles methods is very difficult and challenging because of the enormity of the conformational search space that needs to be searched. Any information that can reduce the conformational search space can potentially make the structure prediction method more efficient. In this thesis, two models have been developed to predict contacts between non-local residues of a protein. The first model uses an integer linear optimization formulation to predict non-local hydrophobic contacts of an alpha-helical protein. The second model also uses an integer linear optimization formulation to predict contacts between non-local residues of beta, alpha + beta, and alpha/beta proteins. The predicted contacts can be used to generate distance bounds between contacting residues. These bounds can prove very useful for first principles methods like ASTRO-FOLD to reduce the protein conformational search space. The problem of improving the accuracy of predicted contacts has been addressed by generating an optimal set of filters. The selection of optimal filters has been formulated as an integer linear optimization problem. The usefulness and effectiveness of proposed models for tertiary structure prediction has been tested and validated using a set of test proteins including test cases from blind protein structure prediction experiments.
机译:蛋白质获得其稳定和功能性三维结构的过程称为蛋白质折叠过程。蛋白质折叠过程的理解是计算生物学中最重要和最具挑战性的问题之一。已经提出了多种从其氨基酸序列确定蛋白质的三维天然结构的方法。这些方法中的一些使用现有的实验确定的结构,而其他从头开始的方法并不依赖于现有的结构进行预测。;一旦做出预测,识别最佳结构非常重要(最类似于天然结构) )的预测结构集合。安芬森的假设指出,天然结构与全球吉布斯自由能极小值相对应。已经开发了一个基于线性规划的模型,该模型使用该假设作为主要标准,以生成Calpha-Calpha距离相关的力场。采用了多样化的蛋白质组和改进的诱饵生成技术来生成一组具有挑战性的高质量训练诱饵。发现使用该模型生成的Calpha-Calpha距离相关力场在从高分辨率构象体集合中选择天然结构方面非常成功。已经开发了另一种基于线性规划的模型,以产生包括残基的侧链原子的存在的侧链质心距离相关力场。发现该力场在区分蛋白质的天然和非天然结构方面更为成功。这些力场也可以用于折叠识别和从头设计蛋白质。由于需要搜索的构象搜索空间巨大,因此使用第一原理方法进行蛋白质结构预测非常困难和挑战。任何可以减少构象搜索空间的信息都可能使结构预测方法更有效。在本文中,已经开发了两个模型来预测蛋白质的非局部残基之间的接触。第一个模型使用整数线性优化公式来预测α-螺旋蛋白的非局部疏水性接触。第二个模型还使用整数线性优化公式来预测β,α+β和α/β蛋白的非局部残基之间的接触。预测的接触可用于生成接触残基之间的距离范围。这些界线对于诸如ASTRO-FOLD之类的首要原理方法可以非常有用,以减少蛋白质构象搜索空间。通过产生一组最佳的滤波器,已经解决了提高预测的联系的准确性的问题。最佳滤波器的选择已公式化为整数线性优化问题。已使用一组测试蛋白(包括来自盲目的蛋白结构预测实验的测试用例)测试并验证了所提出的模型对三级结构预测的有效性和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号