首页> 外文期刊>Applied Spectroscopy >A Graphical Method to Evaluate Spectral Preprocessing in Multivariate Regression Calibrations: Example with Savitzky–Golay Filters and Partial Least Squares Regression
【24h】

A Graphical Method to Evaluate Spectral Preprocessing in Multivariate Regression Calibrations: Example with Savitzky–Golay Filters and Partial Least Squares Regression

机译:在多元回归校准中评估光谱预处理的图形方法:带有Savitzky-Golay滤波器和偏最小二乘回归的示例

获取原文
获取原文并翻译 | 示例
           

摘要

In multivariate regression analysis of spectroscopy data, spectral preprocessing is often performed to reduce unwanted background information (offsets, sloped baselines) or accentuate absorption features in intrinsically overlapping bands. These procedures, also known as pretreatments, are commonly smoothing operations or derivatives. While such operations are often useful in reducing the number of latent variables of the actual decomposition and lowering residual error, they also run the risk of misleading the practitioner into accepting calibration equations that are poorly adapted to samples outside of the calibration. The current study developed a graphical method to examine this effect on partial least squares (PLS) regression calibrations of near-infrared (NIR) reflection spectra of ground wheat meal with two analytes, protein content and sodium dodecyl sulfate sedimentation (SDS) volume (an indicator of the quantity of the gluten proteins that contribute to strong doughs). These two properties were chosen because of their differing abilities to be modeled by NIR spectroscopy: excellent for protein content, fair for SDS sedimentation volume. To further demonstrate the potential pitfalls of preprocessing, an artificial component, a randomly generated value, was included in PLS regression trials. Savitzky–Golay (digital filter) smoothing, first-derivative, and second-derivative preprocess functions (5 to 25 centrally symmetric convolution points, derived from quadratic polynomials) were applied to PLS calibrations of 1 to 15 factors. The results demonstrated the danger of an over reliance on preprocessing when (1) the number of samples used in a multivariate calibration is low (<50), (2) the spectral response of the analyte is weak, and (3) the goodness of the calibration is based on the coefficient of determination (R2) rather than a term based on residual error. The graphical method has application to the evaluation of other preprocess functions and various types of spectroscopy data.
机译:在光谱数据的多元回归分析中,通常会执行光谱预处理以减少不需要的背景信息(偏移,基线倾斜)或本质上重叠的谱带中的吸收特征。这些程序,也称为预处理,通常是平滑操作或导数。尽管这样的操作通常在减少实际分解的潜在变量的数量和降低残留误差方面很有用,但是它们也冒着误导从业人员接受不太适合于校准以外样本的校准方程式的风险。当前的研究开发了一种图形方法来检查这种影响对具有两种分析物(蛋白质含量和十二烷基硫酸钠沉淀(SDS)体积)的两种分析物的地面小麦粉近红外(NIR)反射光谱的偏最小二乘(PLS)回归校准的影响有助于形成坚硬面团的面筋蛋白数量的指标)。选择这两个属性是因为它们具有通过NIR光谱法建模的能力不同:蛋白质含量极佳,SDS沉降量中等。为了进一步说明预处理的潜在陷阱,PLS回归试验中包含了一个人工生成的随机值。将Savitzky-Golay(数字滤波器)平滑,一阶和二阶预处理函数(5至25个中心对称卷积点,从二次多项式得出)应用于1至15个因子的PLS校准。结果表明,当(1)多元校准中使用的样品数量少(<50),(2)分析物的光谱响应较弱以及(3)样品纯度过高时,过分依赖预处理存在危险。校准基于确定系数(R 2 ),而不是基于残差的项。图形方法可用于评估其他预处理功能和各种类型的光谱数据。

著录项

  • 来源
    《Applied Spectroscopy》 |2010年第1期|73-82|共10页
  • 作者

  • 作者单位
  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号