...
首页> 外文期刊>Canadian Journal of Forest Research >On the selection of samples for multivariate regression analysis: application to near-infrared (NIR) calibration models for the prediction of pulp yield in Eucalyptus nitens.
【24h】

On the selection of samples for multivariate regression analysis: application to near-infrared (NIR) calibration models for the prediction of pulp yield in Eucalyptus nitens.

机译:关于用于多元回归分析的样本的选择:在近红外(NIR)校准模型中的应用,以预测桉树的果肉产量。

获取原文
获取原文并翻译 | 示例

摘要

The effects of using reduced calibration sets on the development of near-infrared (NIR) calibration models for the prediction of kraft pulp yield in Eucalyptus nitens trees were explored. Three selection techniques based on NIR spectral data (CADEX (computer-aided design of experiments), DUPLEX, and SELECT algorithms) and one selection method based on a measured property (RANKING algorithm) were used for analysis and compared against a model using all data. The effect of using calibration sets of different sizes was also evaluated. All sample-selection methods resulted in models of similar performance compared with the model fitted using all samples. For calibration purposes, RANKING selection resulted in models with the lowest errors of cross-validation, followed by the DUPLEX, CADEX, and SELECT methods. In terms of validation, the SELECT and CADEX methods resulted in lower errors of prediction compared with the DUPLEX and RANKING algorithms. In general, cross-validation and prediction errors decreased as the number of calibration samples increased. These results show that it is possible to obtain adequate NIR calibration models with a reduced number of samples allowing the remaining samples to be used for model validation and that sample selection based on NIR spectral data alone is as successful as selection based on a measured property.
机译:探索了减少校准集对开发近红外(NIR)校准模型以预测桉树纸浆产量的影响。使用三种基于近红外光谱数据的选择技术(CADEX(计算机辅助实验设计),DUPLEX和SELECT算法)和一种基于测量特性的选择方法(RANKING算法)进行分析,并与使用所有数据的模型进行比较。还评估了使用不同大小的校准集的效果。与使用所有样本拟合的模型相比,所有样本选择方法都产生了性能相似的模型。为了进行校准,选择RANKING会导致交叉验证误差最小的模型,其次是DUPLEX,CADEX和SELECT方法。在验证方面,与DUPLEX和RANKING算法相比,SELECT和CADEX方法的预测误差较小。通常,交叉验证和预测误差会随着校准样本数量的增加而减少。这些结果表明,可以使用减少的样本数量来获得足够的NIR校准模型,从而允许将其余样本用于模型验证,并且仅基于NIR光谱数据的样本选择与基于测量特性的选择一样成功。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号