...
首页> 外文期刊>Applied Geochemistry: Journal of the International Association of Geochemistry and Cosmochemistry >Factor analysis applied to regional geochemical data: problems and possibilities
【24h】

Factor analysis applied to regional geochemical data: problems and possibilities

机译:因子分析应用于区域地球化学数据:问题和可能性

获取原文
获取原文并翻译 | 示例
           

摘要

A large regional geochemical data set of C-horizon podzol samples from a 188,000 km(2) area in the European Arctic, analysed for more than 50 elements, was used to test the influence of different variants of factor analysis on the results extracted. Due to the nature of regional geochemical data (neither normal nor log-normal, strongly skewed, often multimodal data distributions), the simplest methods of factor analysis with the least statistical assumptions perform best. As a result of this test it can generally be suggested to use principal factor analysis with an orthogonal rotation for such data. Selecting the number of factors to extract is difficult, however, the scree plot provides some useful help. For the test data, a low number of extracted factors gave the most informative results. Deleting or adding just 1 element in the input matrix can drastically change the results of factor analysis. Given that selection of elements is often rather based on availability of analytical packages (or detection limits) than on geochemical reasoning this is a disturbing result. Factor analysis revealed the most interesting data structures when a low number of variables were entered. A graphical presentation of the loadings and a simple, automated mapping technique allows extraction of the most interesting results of different factor analyses in one glance. Results presented here underline the importance of careful univariate data analysis prior to entering factor analysis. Outliers should be removed from the dataset and different populations present in the data should be treated separately. Factor analysis can be used to explore a large data set for hidden multivariate data structures. (C) 2002 Elsevier Science Ltd. All rights reserved. [References: 53]
机译:来自欧洲北极地区188,000 km(2)区域的C-水平podzol样本的大型区域地球化学数据集,分析了50多种元素,用于测试因子分析的不同变体对提取结果的影响。由于区域地球化学数据的性质(既不是正态也不是对数正态,偏度很大,通常是多峰数据分布),最简单的因子分析方法(具有最少的统计假设)表现最佳。作为该测试的结果,通常建议对此类数据使用正交旋转的主因子分析。选择要提取的因子数量很困难,但是,卵石图提供了一些有用的帮助。对于测试数据,少量提取的因子给出的信息最多。在输入矩阵中删除或仅添加1个元素会大大改变因子分析的结果。鉴于元素的选择通常是基于分析包的可用性(或检测极限),而不是基于地球化学推理,这是一个令人不安的结果。当输入少量变量时,因子分析显示出最有趣的数据结构。负载的图形化显示和简单的自动映射技术允许一目了然地提取不同因子分析的最有趣结果。本文介绍的结果强调了在进行因子分析之前仔细进行单变量数据分析的重要性。离群值应从数据集中删除,数据中存在的不同种群应分开对待。因子分析可用于探索隐藏的多元数据结构的大型数据集。 (C)2002 Elsevier ScienceLtd。保留所有权利。 [参考:53]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号