首页> 美国卫生研究院文献>Acta Crystallographica Section D: Biological Crystallography >Using support vector machines to improve elemental ion identification in macromolecular crystal structures
【2h】

Using support vector machines to improve elemental ion identification in macromolecular crystal structures

机译:使用支持向量机改善大分子晶体结构中元素离子的识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

In the process of macromolecular model building, crystallographers must examine electron density for isolated atoms and differentiate sites containing structured solvent molecules from those containing elemental ions. This task requires specific knowledge of metal-binding chemistry and scattering properties and is prone to error. A method has previously been described to identify ions based on manually chosen criteria for a number of elements. Here, the use of support vector machines (SVMs) to automatically classify isolated atoms as either solvent or one of various ions is described. Two data sets of protein crystal structures, one containing manually curated structures deposited with anomalous diffraction data and another with automatically filtered, high-resolution structures, were constructed. On the manually curated data set, an SVM classifier was able to distinguish calcium from manganese, zinc, iron and nickel, as well as all five of these ions from water molecules, with a high degree of accuracy. Additionally, SVMs trained on the automatically curated set of high-resolution structures were able to successfully classify most common elemental ions in an independent validation test set. This method is readily extensible to other elemental ions and can also be used in conjunction with previous methods based on a priori expectations of the chemical environment and X-ray scattering.
机译:在建立大分子模型的过程中,结晶学家必须检查孤立原子的电子密度,并将含有结构化溶剂分子的位点与含有元素离子的位点区分开。该任务需要对金属结合化学和散射特性有特定的了解,并且容易出错。先前已经描述了一种基于针对多个元素的手动选择标准来识别离子的方法。在此,描述了使用支持向量机(SVM)将孤立的原子自动分类为溶剂或各种离子中的一种。构造了两个蛋白质晶体结构数据集,一个数据集包含沉积有异常衍射数据的手动组织结构,另一个包含自动过滤的高分辨率结构。在手动管理的数据集上,SVM分类器能够以较高的准确度将钙与锰,锌,铁和镍以及所有这五个离子与水分子区分开。此外,在自动整理的高分辨率结构集上训练的SVM能够在独立的验证测试集中成功分类最常见的元素离子。该方法易于扩展到其他元素离子,也可以基于对化学环境和X射线散射的先验期望,与以前的方法结合使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号