首页> 外文期刊>Journal of Chemometrics >A comparison of nine PLS1 algorithms
【24h】

A comparison of nine PLS1 algorithms

机译:九种PLS1算法的比较

获取原文
获取原文并翻译 | 示例
           

摘要

Nine PLS1 algorithms were evaluated, primarily in terms of their numerical stability, and secondarily their speed. There were six existing algorithms: (a) NIPALS by Wold; (b) the non-orthogonalized scores algorithm by Martens; (c) Bidiag2 by Golub and Kahan; (d) SIMPLS by de Jong; (e) improved kernel PLS by Dayal; and (f) PLSF by Manne. Three new algorithms were created: (g) direct-scores PLS1 based on a new recurrent formula for the calculation of basis vectors yielding scores directly from X and y; (h) Krylov PLS1 with its regression vector defined explicitly, using only the original X and y; (i) PLSPLS1 with its regression vector recursively defined from X and the regression vectors of its previous recursions. Data from IR and NIR spectrometers applied to food, agricultural, and pharmaceutical products were used to demonstrate the numerical stability. It was found that three methods (c, f, h) create regression vectors that do not well resemble the corresponding precise PLS1 regression vectors. Because of this, their loading and score vectors were also concluded to be deviating, and their models of X and the corresponding residuals could be shown to be numerically suboptimal in a least squares sense. Methods (a, b, e, g) were the most stable. Two of them (e, g) were not only numerically stable but also much faster than methods (a, b). The fast method (d) and the moderately fast method (i) showed a tendency to become unstable at high numbers of PLS factors.
机译:评估了9种PLS1算法,主要是在数值稳定性方面,其次是速度。现有六种算法:(a)Wold的NIPALS; (b)Martens的非正交分数算法; (c)Golub和Kahan编写的Bidiag2; (d)de Jong撰写的SIMPLS; (e)Dayal改进了内核PLS; (f)Manne的PLSF。创建了三个新算法:(g)基于新的递归公式的直接得分PLS1,用于计算基础向量,直接从X和y产生分数; (h)仅使用原始的X和y明确定义其回归向量的Krylov PLS1; (i)PLSPLS1,其回归向量是从X及其先前的回归的回归向量中递归定义的。来自食品,农产品和药品的红外光谱仪和近红外光谱仪的数据用于证明数值稳定性。已经发现,三种方法(c,f,h)创建的回归向量与相应的精确PLS1回归向量不太相似。因此,它们的负载和得分向量也得出了偏差,并且它们的X模型和相应的残差在最小二乘意义上可能显示为次优。方法(a,b,e,g)最稳定。其中两个(e,g)不仅在数值上稳定,而且比方法(a,b)快得多。快速方法(d)和中度快速方法(i)在大量PLS因子下表现出变得不稳定的趋势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号