Classifying and analyzing small-angle scattering data using weighted k nearest neighbors machine learning techniques

Archibald Richard K.; Doucet Mathieu; Johnston Travis; Young Steven R.; Yang Erika; Heller William T.

首页> 外文期刊>Journal of Applied Crystallography >Classifying and analyzing small-angle scattering data using weighted k nearest neighbors machine learning techniques

【24h】

Classifying and analyzing small-angle scattering data using weighted k nearest neighbors machine learning techniques

机译：使用加权k最近邻居机学习技术进行分类和分析小角度散射数据

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A consistent challenge for both new and expert practitioners of small-angle scattering (SAS) lies in determining how to analyze the data, given the limited information content of said data and the large number of models that can be employed. Machine learning (ML) methods are powerful tools for classifying data that have found diverse applications in many fields of science. Here, ML methods are applied to the problem of classifying SAS data for the most appropriate model to use for data analysis. The approach employed is built around the method of weighted k nearest neighbors (wKNN), and utilizes a subset of the models implemented in the SasView package (https://www. sasview.org/) for generating a well defined set of training and testing data. The prediction rate of the wKNN method implemented here using a subset of SasView models is reasonably good for many of the models, but has difficulty with others, notably those based on spherical structures. A novel expansion of the wKNN method was also developed, which uses Gaussian processes to produce local surrogate models for the classification, and this significantly improves the classification accuracy. Further, by integrating a stochastic gradient descent method during post-processing, it is possible to leverage the local surrogate model both to classify the SAS data with high accuracy and to predict the structural parameters that best describe the data. The linking of data classification and model fitting has the potential to facilitate the translation of measured data into results for both novice and expert practitioners of SAS.

机译：对于小角度散射（SAS）的新的和专家从业者的一致挑战在于确定如何分析数据，给出了所述数据的有限信息内容和可以采用的大量模型。机器学习（ML）方法是用于分类数据在许多科学领域中发现不同应用的数据的强大工具。这里，将ML方法应用于对用于数据分析的最合适模型进行分类SAS数据的问题。围绕加权K最近邻居（WKNN）的方法构建了所采用的方法，并利用SASVIEW包中实现的模型的子集（https：// www.cw.frow。sasview.org/），用于生成定义的一组培训和测试数据。使用SASView型号的子集实现的WKNN方法的预测率适用于许多模型，但是与他人难以困难，特别是基于球面结构的型号。还开发了一种新颖的WKNN方法扩展，它使用高斯工艺为分类产生局部代理模型，这显着提高了分类精度。此外，通过在后处理期间集成随机梯度下降方法，可以利用本地代理模型，以便高精度地将SAS数据分类，并预测最能描述数据的结构参数。数据分类和模型拟合的链接有可能促进测量数据的转换为SA的新手和专家从业者的结果。

著录项

来源
《Journal of Applied Crystallography》 |2020年第2期|共9页
作者
Archibald Richard K.; Doucet Mathieu; Johnston Travis; Young Steven R.; Yang Erika; Heller William T.;
展开▼
作者单位

Oak Ridge Natl Lab Comp Sci &

Math Div POB 2009 Oak Ridge TN 37831 USA;

Oak Ridge Natl Lab Neutron Scattering Div POB 2009 Oak Ridge TN 37831 USA;

Oak Ridge Natl Lab Comp Sci &

Math Div POB 2009 Oak Ridge TN 37831 USA;

Oak Ridge Natl Lab Comp Sci &

Math Div POB 2009 Oak Ridge TN 37831 USA;

Oak Ridge Natl Lab Comp Sci &

Math Div POB 2009 Oak Ridge TN 37831 USA;

Oak Ridge Natl Lab Neutron Scattering Div POB 2009 Oak Ridge TN 37831 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类应用晶体学;晶体学;
关键词
small-angle scattering data; machine learning; modeling; SasView;

机译：小角度散射数据;机器学习;建模;SASVIEW;

相似文献

外文文献
中文文献
专利

1. Classifying and analyzing small-angle scattering data using weighted k nearest neighbors machine learning techniques [J] . Archibald Richard K., Doucet Mathieu, Johnston Travis, Journal of Applied Crystallography . 2020,第2期

机译：使用加权k最近邻居机学习技术进行分类和分析小角度散射数据
2. Learning k-Nearest Neighbors Classifier from Distributed Data [J] . Khedr, Ahmed M. Computing and informatics . 2012,第3期

机译：从分布式数据中学习k最近邻分类器
3. LEARNING K-NEAREST NEIGHBORS CLASSIFIER FROM DISTRIBUTED DATA [J] . Ahmed M. Khedr Computing and informatics . 2008,第3期

机译：从分布式数据中学习K-近邻分类器
4. Improving Accuracy for Classifying Selected Medical Datasets with Weighted Nearest Neighbors and Fuzzy Nearest Neighbors Algorithms [C] . Qasem Monzer, Nour Mohamed 2015 International Conference on Cloud Computing . 2015

机译：使用加权最近邻和模糊最近邻算法提高对选定医学数据集进行分类的准确性
5. Weighted K-Nearest Neighbor Algorithm as an Object Localization technique using Passive RFID Tags. [D] . Shetty, Akshay. 2010

机译：加权K最近邻算法作为使用无源RFID标签的对象定位技术。
6. Remaining Useful Life Estimation of Insulated Gate Biploar Transistors (IGBTs) Based on a Novel Volterra k-Nearest Neighbor Optimally Pruned Extreme Learning Machine (VKOPP) Model Using Degradation Data [O] . Zhen Liu, Wenjuan Mei, Xianping Zeng, 2017

机译：基于新型Volterra k最近邻最优修剪极限学习机（VKOPP）模型的绝缘栅双极晶体管（IGBT）剩余寿命估算
7. Figure 7: Box-plots with the weighted accuracy and F1 score for the theoretical vs. theoretical classification of rotamers and families of rotamers, using Nearest Neighbor (NN), Decision Tree (DT), Random Forest (RF), Multi-Layer Perceptron (MLP) and Support Vector Machine (SVM) classifiers. [O] . -1

机译：图7：具有加权准确性和F1的箱子图，用于理论与旋转仪和旋转仪系列的理论分类，使用最近的邻居（NN），决策树（DT），随机森林（RF），多层植物（MLP）和支持向量机（SVM）分类器。

Classifying and analyzing small-angle scattering data using weighted k nearest neighbors machine learning techniques

摘要

著录项

相似文献

相关主题

期刊订阅