...
首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >simrel - A versatile tool for linear model data simulation based on the concept of a relevant subspace and relevant predictors
【24h】

simrel - A versatile tool for linear model data simulation based on the concept of a relevant subspace and relevant predictors

机译:simrel-基于相关子空间和相关预测变量概念的线性模型数据仿真的多功能工具

获取原文
获取原文并翻译 | 示例
           

摘要

In the field of chemometrics and other areas of data analysis the development of new methods for statistical inference and prediction is the focus of many studies. The requirement to document the properties of new methods is inevitable, and often simulated data are used for this purpose. However, when it comes to simulating data there are few standard approaches. In this paper we propose a very transparent and versatile method for simulating response and predictor data from a multiple linear regression model which hopefully may serve as a standard tool simulating linear model data. The approach uses the principle of a relevant subspace for prediction, which is known both from Partial Least Squares and envelope models, and is essentially based on a re-parametrization of the random x regression model. The approach also allows for defining a subset of relevant observable predictor variables spanning the relevant latent subspace, which is handy for exploring methods for variable selection. The data properties are defined by a small set of input-parameters defined by the analyst. The versatile approach can be used to simulate a great variety of data with varying properties in order to compare statistical methods. The method has been implemented in an R-package and its use is illustrated by examples. (C) 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
机译:在化学计量学和其他数据分析领域,统计推断和预测新方法的开发是许多研究的重点。记录新方法的属性是不可避免的,为此经常使用模拟数据。但是,在模拟数据时,很少有标准方法。在本文中,我们提出了一种非常透明且通用的方法,用于从多元线性回归模型中模拟响应和预测变量数据,希望可以用作模拟线性模型数据的标准工具。该方法使用相关的子空间进行预测的原理,这从偏最小二乘和包络模型中都可以知道,并且基本上基于随机x回归模型的重新参数化。该方法还允许定义跨越相关潜在子空间的相关可观察预测变量的子集,这对于探索变量选择方法非常方便。数据属性由分析人员定义的一小组输入参数定义。通用方法可用于模拟各种性质各异的数据,以便比较统计方法。该方法已在R包中实现,并通过示例说明了其用法。 (C)2015作者。由Elsevier B.V.发布。这是CC BY许可下的开放获取文章(http://creativecommons.org/licenses/by/4.0/)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号