首页> 美国卫生研究院文献>Biophysical Journal >Machine Learning Methods for X-Ray Scattering Data Analysis from Biomacromolecular Solutions
【2h】

Machine Learning Methods for X-Ray Scattering Data Analysis from Biomacromolecular Solutions

机译:用于生物大分子溶液的X射线散射数据分析的机器学习方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Small-angle x-ray scattering (SAXS) of biological macromolecules in solutions is a widely employed method in structural biology. SAXS patterns include information about the overall shape and low-resolution structure of dissolved particles. Here, we describe how to transform experimental SAXS patterns to feature vectors and how a simple k-nearest neighbor approach is able to retrieve information on overall particle shape and maximal diameter (Dmax) as well as molecular mass directly from experimental scattering data. Based on this transformation, we develop a rapid multiclass shape-classification ranging from compact, extended, and flat categories to hollow and random-chain-like objects. This classification may be employed, e.g., as a decision block in automated data analysis pipelines. Further, we map protein structures from the Protein Data Bank into the classification space and, in a second step, use this mapping as a data source to obtain accurate estimates for the structural parameters (Dmax, molecular mass) of the macromolecule under study based on the experimental scattering pattern alone, without inverse Fourier transform for Dmax. All methods presented are implemented in a Fortran binary DATCLASS, part of the ATSAS data analysis suite, available on Linux, Mac, and Windows and free for academic use.
机译:溶液中生物大分子的小角X射线散射(SAXS)是结构生物学中广泛使用的方法。 SAXS模式包括有关溶解粒子的整体形状和低分辨率结构的信息。在这里,我们描述了如何将实验SAXS模式转换为特征向量,以及简单的k近邻方法如何直接从实验散射数据中检索有关总体粒子形状和最大直径(Dmax)以及分子质量的信息。基于此转换,我们开发了快速的多类形状分类,从紧凑,扩展和平面类别到空心和类似随机链的对象。该分类可以例如用作自动化数据分析管线中的决策块。此外,我们将蛋白质数据库中的蛋白质结构映射到分类空间中,并在第二步中,使用该映射作为数据源,以基于以下数据获得对正在研究的大分子的结构参数(Dmax,分子质量)的准确估算单独的实验散射图,没有Dmax的傅立叶逆变换。所介绍的所有方法均在Fortran二进制DATCLASS中进行实现,该DATCLASS是ATSAS数据分析套件的一部分,可在Linux,Mac和Windows上使用,并且可免费用于学术领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号