首页> 外文期刊>Journal of environmental monitoring: JEM >Evaluation of the effect of data pre-treatment procedures on classical pattern recognition and principal components analysis: a case study for the geographical classification of tea
【24h】

Evaluation of the effect of data pre-treatment procedures on classical pattern recognition and principal components analysis: a case study for the geographical classification of tea

机译:评估数据预处理程序对经典模式识别和主成分分析的影响:以茶的地理分类为例

获取原文
获取原文并翻译 | 示例
       

摘要

A simple transformation that uses the half-range and central value has been used as a data pre-treatment procedure for principal component analysis (PCA) and pattern recognition techniques. The results obtained have been compared with the results from classical normalisation of data (mean normalisation, maximum normalisation and range normalisation), autoscaling and the minimum-maximum transformation. Three data sets were used in the study. The first was formed by determining 17 elements in 53 tea samples (901 pieces of data). The second and third data sets arose from two long-term drift studies performed to examine instrumental stability at standard and robust conditions. The instruments used were an inductively coupled plasma atomic emission spectrometer and an inductively coupled plasma mass spectrometer. Each drift diagnosis experiment consisted of replicate determinations of a test solution containing 15 analytes at 10 mg 1~(-1) over 8 h without recalibration. Twenty-nine emission lines were determined 99 times, thus, each data set was formed by 2881 pieces of data. Data pre-treatment was applied to the three data sets prior to the use of principal component analysis, cluster analysis, linear discrimination analysis and soft independent modelling of class analogy. The study revealed that the half-range and central value transformation resulted in a better classification of the tea samples than that achieved using the classical normalisation. The loadings in the PCA for the long-term stability study, under both standard and robust conditions, were found to be similar to the drift trends only when the minimum-maximum transformation and the mean or maximum normalisations were used as data pre-treatments.
机译:使用半值和中心值的简单转换已用作主成分分析(PCA)和模式识别技术的数据预处理程序。将获得的结果与数据的经典归一化(均值归一化,最大归一化和范围归一化),自动缩放和最小-最大变换的结果进行了比较。该研究使用了三个数据集。第一个是通过确定53个茶样品(901条​​数据)中的17种元素形成的。第二个和第三个数据集来自两项长期漂移研究,目的是检查标准和稳健条件下的仪器稳定性。使用的仪器是电感耦合等离子体原子发射光谱仪和电感耦合等离子体质谱仪。每个漂移诊断实验都包括在8小时内重复测定包含15种分析物(浓度为10 mg 1〜(-1))的测试溶液,而无需重新校准。确定了29条发射线99次,因此,每个数据集由2881条数据组成。在使用主成分分析,聚类分析,线性判别分析和类比法软独立建模之前,对这三个数据集进行了数据预处理。研究表明,与传统归一化方法相比,半范围和中心值转换对茶样品的分类效果更好。在标准和鲁棒条件下,用于长期稳定性研究的PCA中的载荷只有在将最小-最大变换和平均或最大归一化用作数据预处理时才与漂移趋势相似。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号