首页> 外文期刊>Proteins: Structure, Function, and Genetics >Data mining crystallization databases: knowledge-based approaches to optimize protein crystal screens.
【24h】

Data mining crystallization databases: knowledge-based approaches to optimize protein crystal screens.

机译:数据挖掘结晶数据库:基于知识的方法来优化蛋白质晶体筛选。

获取原文
获取原文并翻译 | 示例
           

摘要

Protein crystallization is a major bottleneck in protein X-ray crystallography, the workhorse of most structural proteomics projects. Because the principles that govern protein crystallization are too poorly understood to allow them to be used in a strongly predictive sense, the most common crystallization strategy entails screening a wide variety of solution conditions to identify the small subset that will support crystal nucleation and growth. We tested the hypothesis that more efficient crystallization strategies could be formulated by extracting useful patterns and correlations from the large data sets of crystallization trials created in structural proteomics projects. A database of crystallization conditions was constructed for 755 different proteins purified and crystallized under uniform conditions. Forty-five percent of the proteins formed crystals. Data mining identified the conditions that crystallize the most proteins, revealed that many conditions are highly correlated in their behavior, and showed that the crystallization success rate is markedly dependent on the organism from which proteins derive. Of the proteins that crystallized in a 48-condition experiment, 60% could be crystallized in as few as 6 conditions and 94% in 24 conditions. Consideration of the full range of information coming from crystal screening trials allows one to design screens that are maximally productive while consuming minimal resources, and also suggests further useful conditions for extending existing screens.
机译:蛋白质结晶是蛋白质X射线晶体学(大多数结构蛋白质组学项目的主力军)的主要瓶颈。由于控制蛋白质结晶的原理了解得太少,以至于无法在强烈的预测意义上使用它们,因此最常见的结晶策略需要筛选各种各样的溶液条件,以鉴定将支持晶体成核和生长的小子集。我们测试了以下假设:可以从结构蛋白质组学项目中创建的大量结晶试验数据集中提取有用的模式和相关性,从而制定更有效的结晶策略。建立了在均匀条件下纯化和结晶的755种不同蛋白质的结晶条件数据库。百分之四十五的蛋白质形成晶体。数据挖掘确定了使大多数蛋白质结晶的条件,揭示了许多条件在其行为方面高度相关,并表明结晶成功率明显取决于蛋白质的来源生物。在48个条件的实验中结晶的蛋白质中,有60%的蛋白质可以在6个条件下结晶,而94%的蛋白质可以在24个条件下结晶。考虑到来自晶体筛选试验的全部信息,可以设计出生产率最高,同时消耗最少资源的屏幕,并且还提出了扩展现有屏幕的进一步有用条件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号