首页> 外文期刊>Bioinformatics >Sequence-based prediction of protein crystallization, purification and production propensity
【24h】

Sequence-based prediction of protein crystallization, purification and production propensity

机译:基于序列的蛋白质结晶,纯化和生产倾向性预测

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: X-ray crystallography-based protein structure determination, which accounts for majority of solved structures, is characterized by relatively low success rates. One solution is to build tools which support selection of targets that are more likely to crystallize. Several in silico methods that predict propensity of diffraction-quality crystallization from protein chains were developed. We show that the quality of their predictions drops when applied to more recent crystallization trails, which calls for new solutions. We propose a novel approach that alleviates drawbacks of the existing methods by using a recent dataset and improved protocol to annotate progress along the crystallization process, by predicting the success of the entire process and steps which result in the failed attempts, and by utilizing a compact and comprehensive set of sequence-derived inputs to generate accurate predictions.Results: The proposed PPCpred (predictor of protein Production, Purification and Crystallization) predict propensity for production of diffraction-quality crystals, production of crystals, purification and production of the protein material. PPCpred utilizes comprehensive set of inputs based on energy and hydrophobicity indices, composition of certain amino acid types, predicted disorder, secondary structure and solvent accessibility, and content of certain buried and exposed residues. Our method significantly outperforms alignment-based predictions and several modern crystallization propensity predictors. Receiver operating characteristic (ROC) curves show that PPCpred is particularly useful for users who desire high true positive (TP) rates, i.e. low rate of mispredictions for solvable chains. Our model reveals several intuitive factors that influence the success of individual steps and the entire crystallization process, including the content of Cys, buried His and Ser, hydrophobic/hydrophilic segments and the number of predicted disordered segments.
机译:动机:基于X射线晶体学的蛋白质结构确定(其解决的结构占大多数)的特点是成功率相对较低。一种解决方案是构建工具,以支持选择更容易确定目标的目标。开发了几种计算机方法,这些方法可预测从蛋白质链发生衍射质量结晶的倾向。我们表明,将其预测应用于最新的结晶轨迹时,其质量会下降,这需要新的解决方案。我们提出了一种新颖的方法,该方法通过使用最新的数据集和改进的协议来注释结晶过程中的进展,通过预测导致失败尝试的整个过程和步骤的成功以及通过使用紧凑的协议来缓解现有方法的弊端结果:提出的PPCpred(蛋白质生产,纯化和结晶的预测因子)预测了衍射级晶体的生产,晶体的生产,蛋白质材料的纯化和生产的倾向。 PPCpred根据能量和疏水性指数,某些氨基酸类型的组成,预测的无序性,二级结构和溶剂可及性以及某些掩埋和暴露残基的含量,利用全面的输入信息。我们的方法大大优于基于比对的预测和几种现代结晶倾向预测器。接收器工作特性(ROC)曲线表明,PPCpred对于需要较高的真实正(TP)速率(即对可解决链的错误预测率较低)的用户特别有用。我们的模型揭示了影响各个步骤和整个结晶过程成功的几个直观因素,包括Cys的含量,埋藏的His和Ser,疏水/亲水链段以及预测的无序链段的数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号