...
首页> 外文期刊>Statistics and computing >A comparison of methods for the fitting of generalized additive models
【24h】

A comparison of methods for the fitting of generalized additive models

机译:广义加性模型拟合方法的比较

获取原文
获取原文并翻译 | 示例

摘要

There are several procedures for fitting generalized additive models, i.e. regression models for an exponential family response where the influence of each single co-variates is assumed to have unknown, potentially non-linear shape. Simulated data are used to compare a smoothing parameter optimization approach for selection of smoothness and of covariates, a stepwise approach, a mixed model approach, and a procedure based on boosting techniques. In particular it is investigated how the performance of procedures is linked to amount of information, type of response, total number of covariates, number of influential covariates, and extent of non-linearity. Measures for comparison are prediction performance, identification of influential covariates, and smoothness of fitted functions. One result is that the mixed model approach returns sparse fits with frequently over-smoothed functions, while the functions are less smooth for the boosting approach and variable selection is less strict. The other approaches are in between with respect to these measures. The boosting procedure is seen to perform very well when little information is available and/or when a large number of covariates is to be investigated. It is somewhat surprising that in scenarios with low information the fitting of a linear model, even with stepwise variable selection, has not much advantage over the fitting of an additive model when the true underlying structure is linear. In cases with more information the prediction performance of all procedures is very similar. So, in difficult data situations the boosting approach can be recommended, in others the procedures can be chosen conditional on the aim of the analysis.
机译:有几种拟合通用加性模型的程序,即用于指数族响应的回归模型,其中假定每个单个协变量的影响都具有未知的,潜在的非线性形状。模拟数据用于比较用于选择平滑度和协变量的平滑参数优化方法,逐步方法,混合模型方法以及基于增强技术的过程。尤其要研究如何将过程的执行与信息量,响应类型,协变量总数,有影响的协变量数目以及非线性程度联系起来。比较的方法是预测性能,确定有影响的协变量以及拟合函数的平滑度。一个结果是,混合模型方法返回的稀疏拟合经常带有过度平滑的函数,而对于增强方法而言,函数不太平滑,而变量选择则不太严格。关于这些措施,其他方法介于两者之间。当几乎没有可用信息和/或要研究大量协变量时,可以看到增强过程执行得很好。令人惊讶的是,在信息量很少的情况下,即使真正的基础结构是线性的,线性模型的拟合(即使具有逐步变量选择)也比附加模型的拟合没有太多优势。在具有更多信息的情况下,所有过程的预测性能都非常相似。因此,在困难的数据情况下,可以建议采用增强方法,而在其他情况下,可以根据分析目的选择程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号