首页> 外文期刊>Journal of Applied Crystallography >Classifying and analyzing small-angle scattering data using weighted k nearest neighbors machine learning techniques
【24h】

Classifying and analyzing small-angle scattering data using weighted k nearest neighbors machine learning techniques

机译:使用加权k最近邻居机学习技术进行分类和分析小角度散射数据

获取原文
获取原文并翻译 | 示例
           

摘要

A consistent challenge for both new and expert practitioners of small-angle scattering (SAS) lies in determining how to analyze the data, given the limited information content of said data and the large number of models that can be employed. Machine learning (ML) methods are powerful tools for classifying data that have found diverse applications in many fields of science. Here, ML methods are applied to the problem of classifying SAS data for the most appropriate model to use for data analysis. The approach employed is built around the method of weighted k nearest neighbors (wKNN), and utilizes a subset of the models implemented in the SasView package (https://www. sasview.org/) for generating a well defined set of training and testing data. The prediction rate of the wKNN method implemented here using a subset of SasView models is reasonably good for many of the models, but has difficulty with others, notably those based on spherical structures. A novel expansion of the wKNN method was also developed, which uses Gaussian processes to produce local surrogate models for the classification, and this significantly improves the classification accuracy. Further, by integrating a stochastic gradient descent method during post-processing, it is possible to leverage the local surrogate model both to classify the SAS data with high accuracy and to predict the structural parameters that best describe the data. The linking of data classification and model fitting has the potential to facilitate the translation of measured data into results for both novice and expert practitioners of SAS.
机译:对于小角度散射(SAS)的新的和专家从业者的一致挑战在于确定如何分析数据,给出了所述数据的有限信息内容和可以采用的大量模型。机器学习(ML)方法是用于分类数据在许多科学领域中发现不同应用的数据的强大工具。这里,将ML方法应用于对用于数据分析的最合适模型进行分类SAS数据的问题。围绕加权K最近邻居(WKNN)的方法构建了所采用的方法,并利用SASVIEW包中实现的模型的子集(https:// www.cw.frow。sasview.org/),用于生成定义的一组培训和测试数据。使用SASView型号的子集实现的WKNN方法的预测率适用于许多模型,但是与他人难以困难,特别是基于球面结构的型号。还开发了一种新颖的WKNN方法扩展,它使用高斯工艺为分类产生局部代理模型,这显着提高了分类精度。此外,通过在后处理期间集成随机梯度下降方法,可以利用本地代理模型,以便高精度地将SAS数据分类,并预测最能描述数据的结构参数。数据分类和模型拟合的链接有可能促进测量数据的转换为SA的新手和专家从业者的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号