首页> 外文期刊>Microchemical Journal: Devoted to the Application of Microtechniques in all Branches of Science >Random forest as a potential multivariate method for near-infrared (NIR) spectroscopic analysis of complex mixture samples: Gasoline and naphtha
【24h】

Random forest as a potential multivariate method for near-infrared (NIR) spectroscopic analysis of complex mixture samples: Gasoline and naphtha

机译:随机森林作为复杂混合物样品(汽油和石脑油)近红外(NIR)光谱分析的潜在多元方法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Random forest (RF) has been demonstrated as a potential multivariate method for near-infrared (NIR) spectroscopic analysis of petroleum-driven products, highly complex mixtures of diverse hydrocarbons. For the study, a NIR dataset of gasoline samples and two separate NIR datasets of naphtha samples were prepared. These samples were carefully prepared over a long period to maximize compositional variation in each dataset. Partial least squares (PLS), the most widely adopted method in multivariate analysis, and RF were used to determine research octane numbers (RONs) of gasoline samples, and total paraffin, total naphthene and total aromatic concentrations of naphtha samples. The resulting accuracies of quantitative analysis for these samples were generally improved when RF was used. In addition, chance for overfitting of a model, which would occur occasionally in PLS modeling, was substantially lessened or possibly eliminated by the use of RF. On the contrary, in the case of RF, a calibration dataset composed of samples with narrow interval in property or concentration variation was required to improve the accuracy. Consequently, RF could be a useful multivariate method to analyze NIR as well as other spectroscopic data acquired from petroleum refining products, when properly utilized.
机译:随机森林(RF)已被证明是一种潜在的多变量方法,可用于石油驱动产品,各种烃类的高度复杂混合物的近红外(NIR)光谱分析。为了进行研究,准备了汽油样品的近红外数据集和石脑油样品的两个独立的近红外数据集。这些样本经过长期精心准备,以最大程度地提高每个数据集中的成分差异。使用偏最小二乘(PLS)(多变量分析中使用最广泛的方法)和RF来确定汽油样品的研究辛烷值(RON),以及石脑油样品的总石蜡,总环烷烃和总芳烃浓度。使用RF时,这些样品定量分析的结果准确性通常得到提高。另外,通过使用RF可以大大减少或可能消除模型过度拟合的可能性(这种情况在PLS建模中偶尔发生)。相反,在RF的情况下,需要由性质或浓度变化范围窄的样本组成的校准数据集,以提高准确性。因此,如果使用得当,RF可能是有用的多变量方法,可用于分析NIR以及从石油精炼产品中获取的其他光谱数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号