...
首页> 外文期刊>Talanta: The International Journal of Pure and Applied Analytical Chemistry >~1H NMR variable selection approaches for classification. A case study: The determination of adulterated foodstuffs
【24h】

~1H NMR variable selection approaches for classification. A case study: The determination of adulterated foodstuffs

机译:〜1 H NMR变量选择方法进行分类。案例研究:掺假食品的测定

获取原文
获取原文并翻译 | 示例

摘要

Whenever dealing with large amount of data as is the case of a NMR spectrum, carrying out a variable selection before applying a multivariate technique is necessary. This work applies various variable selection techniques to extract relevant information from ~1H NMR spectral data. Three approaches have been chosen, because each is based on very different foundations. The first method, called Xdiff, is based on calculating the normalized differences between the mean spectrum of a class considered to be the reference and the spectra of each sample. The second approach is the interval Partial Least Squares method (iPLS), which investigates the influential zones of the spectra that contains the most discriminating predictors calculating local PLS-DA models on narrow intervals. The last one is Genetic Algorithms (GAs) which finds the optimal variables from a random initial subset of variables by means of an iterative process. The performance of each variable selection strategy is determined by the classification results obtained when multiclass Partial Least Squares-Discriminant Analysis is applied. This study has been applied to NMR spectra of culinary spices that might be adulterated with banned dyes such as Sudan dyes (I-IV). The three techniques give neither the same number nor the same selected variables, but they do select a common zone from the spectra containing the most discriminating variables. All three techniques give satisfactory classification and prediction results, being higher than 95% with iPLS and GA and around 89% with Xdiff, therefore the three variable selection techniques are suitable to be used with NMR data in the determination of food adulteration with Sudan dyes as well as the specific type of adulterant used (I-IV).
机译:每当处理大量数据时(例如NMR光谱),都必须在应用多变量技术之前进行变量选择。这项工作应用了各种变量选择技术来从〜1 H NMR光谱数据中提取相关信息。选择了三种方法,因为每种方法基于非常不同的基础。第一种方法称为Xdiff,该方法基于计算被视为参考的一类平均光谱与每个样品的光谱之间的归一化差异。第二种方法是区间偏最小二乘方法(iPLS),该方法研究频谱的影响区域,该区域包含在窄区间上计算局部PLS-DA模型的最具区分性的预测变量。最后一个是遗传算法(GA),它通过迭代过程从随机的初始变量子集中找到最佳变量。每个变量选择策略的性能取决于应用多类偏最小二乘判别分析时获得的分类结果。这项研究已应用于烹饪香料的NMR光谱,这些烹饪香料可能会掺入禁用染料,例如苏丹染料(I-IV)。这三种技术既没有给出相同的数字,也没有给出相同的选定变量,但是它们确实从包含最具区分性的变量的光谱中选择了一个公共区域。三种技术均能给出令人满意的分类和预测结果,iPLS和GA分别高于95%和Xdiff约89%,因此,三种变量选择技术适用于NMR数据确定苏丹红染料的食品掺假。以及所用掺杂剂的特定类型(I-IV)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号