首页> 外文期刊>Methods: A Companion to Methods in Enzymology >Predictive models for protein crystallization.
【24h】

Predictive models for protein crystallization.

机译:蛋白质结晶的预测模型。

获取原文
获取原文并翻译 | 示例
           

摘要

Crystallization of proteins is a nontrivial task, and despite the substantial efforts in robotic automation, crystallization screening is still largely based on trial-and-error sampling of a limited subset of suitable reagents and experimental parameters. Funding of high throughput crystallography pilot projects through the NIH Protein Structure Initiative provides the opportunity to collect crystallization data in a comprehensive and statistically valid form. Data mining and machine learning algorithms thus have the potential to deliver predictive models for protein crystallization. However, the underlying complex physical reality of crystallization, combined with a generally ill-defined and sparsely populated sampling space, and inconsistent scoring and annotation make the development of predictive models non-trivial. We discuss the conceptual problems, and review strengths and limitations of current approaches towards crystallization prediction, emphasizing the importance of comprehensive and valid sampling protocols. In view of limited overlap in techniques and sampling parameters between the publicly funded high throughput crystallography initiatives, exchange of information and standardization should be encouraged, aiming to effectively integrate data mining and machine learning efforts into a comprehensive predictive framework for protein crystallization. Similar experimental design and knowledge discovery strategies should be applied to valid analysis and prediction of protein expression, solubilization, and purification, as well as crystal handling and cryo-protection.
机译:蛋白质的结晶是一项艰巨的任务,尽管在机器人自动化方面做出了巨大努力,但结晶筛选仍主要基于有限试剂和实验参数的有限子集的反复试验。通过NIH蛋白质结构计划为高通量结晶学试点项目提供资金,为以全面且统计有效的形式收集结晶数据提供了机会。因此,数据挖掘和机器学习算法具有提供蛋白质结晶预测模型的潜力。但是,潜在的复杂的物理结晶现实,再加上通常定义不清且稀疏的采样空间,以及评分和注释不一致,使得预测模型的开发变得不那么容易。我们讨论了概念上的问题,并回顾了目前进行结晶预测的方法的优势和局限性,强调了全面有效的采样协议的重要性。鉴于公共资助的高通量结晶学计划之间的技术和采样参数重叠有限,应鼓励信息交换和标准化,目的是将数据挖掘和机器学习工作有效地整合到蛋白质结晶的综合预测框架中。类似的实验设计和知识发现策略应应用于蛋白质表达,增溶和纯化以及晶体处理和冷冻保护的有效分析和预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号