首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >Classification of gas chromatographic fingerprints of saffron using partial least squares discriminant analysis together with different variable selection methods
【24h】

Classification of gas chromatographic fingerprints of saffron using partial least squares discriminant analysis together with different variable selection methods

机译:基于偏最小二乘判别分析和不同变量选择方法的藏红花气相色谱指纹图谱分类

获取原文
获取原文并翻译 | 示例
           

摘要

In the present work, the abilities of five different variable selection methods including recursive partial least squares (rPLS), variable importance in projection (VIP), selectivity ratio (SR), significance multivariate correlation (sMC), and PLS loading weights were evaluated on the supervised classification of gas chromatographic fingerprints of saffron using PLS-discriminant analysis (PLS-DA). In this regard, eighty-three saffron samples analyzed by gas chromatography-flam ionization detector (GC-FID), were used as a case study. The GC-FID chromatograms of saffron samples were baseline corrected and aligned using asymmetric least squares (AsLS) and correlation optimized warping (COW) methods, respectively. Then, the whole digital profiles of preprocessed chromatograms were normalized to internal standard (I.S.), mean-centered, pareto-scaled and finally modeled by PLS-DA to classify saffron samples according to their cultivation areas. Afterwards, performance of different variable selection methods (i.e., rPLS, VIP, SR, sMC and loading weights) for choosing the most important variables (i.e., retention time points) in GC-FID fingerprints, were compared in terms of the model's interpretability and predictability. The results indicated that although different variable selection methods could select different subset of variables, but, prediction ability of all the models were still acceptable. The best model performance was achieved when the result of all variable selection methods were taken into account. Finally, nine secondary metabolites of saffron suggested by almost all selection methods were chosen as the saffron biomarkers.
机译:在目前的工作中,评估了五种不同的变量选择方法的能力,包括递归偏最小二乘(rPLS),变量在投影中的重要性(VIP),选择性比(SR),显着性多元相关性(sMC)和PLS加载权重使用PLS鉴别分析(PLS-DA)对藏红花的气相色谱指纹图谱进行监督分类。在这方面,通过气相色谱-火焰电离检测器(GC-FID)分析的八十三种藏红花样品被用作案例研究。藏红花样品的GC-FID色谱图分别通过不对称最小二乘(AsLS)和相关优化翘曲(COW)方法进行了基线校正和对齐。然后,将预处理色谱图的整个数字图谱归一化为内标(I.S.),以均值为中心,按比例缩放,最后通过PLS-DA建模,以根据藏红花样品的种植区域进行分类。之后,根据模型的可解释性和可比性,比较了用于选择GC-FID指纹中最重要变量(即保留时间点)的不同变量选择方法(即rPLS,VIP,SR,sMC和装载权重)的性能。可预测性。结果表明,尽管不同的变量选择方法可以选择不同的变量子集,但是所有模型的预测能力仍然可以接受。当考虑所有变量选择方法的结果时,可获得最佳模型性能。最后,选择了几乎所有选择方法建议的九种藏红花次生代谢产物作为藏红花生物标志物。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号