首页> 外文期刊>Journal of Chemometrics >HIERARCHICAL MULTIBLOCK PLS AND PC MODELS FOR EASIER MODEL INTERPRETATION AND AS AN ALTERNATIVE TO VARIABLE SELECTION
【24h】

HIERARCHICAL MULTIBLOCK PLS AND PC MODELS FOR EASIER MODEL INTERPRETATION AND AS AN ALTERNATIVE TO VARIABLE SELECTION

机译:多层多块PLS和PC模型,用于更轻松的模型解释和替代变量选择

获取原文
获取原文并翻译 | 示例
       

摘要

In multivariate PLS (partial least squares projection to latent structures) and PC (principal components) models with many variables, plots and lists of b loadings, coefficients, VIPs, etc. become messy and results are difficult to interpret. There is then a strong temptation to reduce the variables to a smaller, more manageable number. This reduction of variables, however, often removes information, makes the interpretation misleading and seriously increases the risk of spurious models. A better alternative is often to divide the variables into conceptually meaningful blocks and then apply hierarchical multiblock PLS (or PC) models. This blocking leads to two model levels: the upper level where the relationships between blocks are modelled and the lower level showing the details of each block. On each level, 'standard' PLS or PC scores and loading plots are available for model interpretation. This allows an interpretation focused on pertinent blocks and their dominant variables. Such blocking is natural and straightforward in spectroscopy (multivariate calibration), quantitative molecular modelling (e.g. CoMFA) and process modelling. The principles of hierarchical multivariate PLS and PC modelling are reviewed, some problems with variable selection are discussed and the approach is illustrated for a data set with around 300 variables and 500 observations taken from a residue catalytic cracker (RCCU) at the Statoil Mongstad refinery in Norway.
机译:在具有许多变量的多元PLS(对潜在结构的局部最小二乘投影)和PC(主成分)模型中,b载荷,系数,VIP等的图和列表变得混乱,结果难以解释。因此,强烈希望将变量减少到更小,更易于管理的数量。但是,变量的减少通常会删除信息,使解释产生误导,并严重增加了伪造模型的风险。更好的选择通常是将变量分成概念上有意义的块,然后应用分层的多块PLS(或PC)模型。这种阻塞导致两个模型级别:对块之间的关系进行建模的上层级别和显示每个块详细信息的下层级别。在每个级别上,都可以使用“标准” PLS或PC分数和加载图来进行模型解释。这样就可以对相关块及其主要变量进行解释。在光谱学(多变量校准),定量分子建模(例如CoMFA)和过程建模中,这种封闭是自然而直接的。回顾了分层多元PLS和PC建模的原理,讨论了变量选择的一些问题,并对从Statoil Mongstad炼油厂的残渣催化裂化装置(RCCU)获得的约300个变量和500个观察值的数据集说明了该方法。挪威。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号