首页> 外文期刊>Computational statistics & data analysis >Generalized additive models and inflated type i error rates of smoother significance tests
【24h】

Generalized additive models and inflated type i error rates of smoother significance tests

机译:广义加性模型和膨胀的I型错误率,进行更平滑的显着性检验

获取原文
获取原文并翻译 | 示例
       

摘要

Generalized additive models (GAMs) have distinct advantages over generalized linear models as they allow investigators to make inferences about associations between outcomes and predictors without placing parametric restrictions on the associations. The variable of interest is often smoothed using a locally weighted scatterplot smoothing (LOESS) and the optimal span (degree of smoothing) can be determined by minimizing the Akaike Information Criterion (AIC). A natural hypothesis when using GAMs is to test whether the smoothing term is necessary or if a simpler model would suffice. The statistic of interest is the difference in deviances between models including and excluding the smoothed term. As approximate chi-square tests of this hypothesis are known to be biased, permutation tests are a reasonable alternative.Wecompare the type I error rates of the chisquare test and of three permutation test methods using synthetic data generated under the null hypothesis. In each permutation method a distribution of differences in deviances is obtained from 999 permuted datasets and the null hypothesis is rejected if the observed statistic falls in the upper 5% of the distribution. One test is a conditional permutation test using the optimal span size for the observed data; this span size is held constant for all permutations. This test is shown to have an inflated type I error rate. Alternatively, the span size can be fixed a priori such that the span selection technique is not reliant on the observed data. This test is shown to be unbiased; however, the choice of span size is not clear. A third method is an unconditional permutation test where the optimal span size is selected for observed and permuted datasets. This test is unbiased though computationally intensive.
机译:广义加性模型(GAM)与广义线性模型相比具有明显的优势,因为它们允许研究人员对结果和预测变量之间的关联进行推断,而无需对关联进行参数限制。通常使用局部加权散点图平滑(LOESS)来平滑目标变量,并且可以通过最小化Akaike信息准则(AIC)来确定最佳跨度(平滑度)。使用GAM时的自然假设是测试是否需要平滑项或是否可以使用更简单的模型。感兴趣的统计量是模型之间(包括和不包括平滑项)偏差的差异。由于已知该假设的近似卡方检验存在偏差,因此排列检验是一个合理的选择。我们使用无效假设下生成的合成数据,比较卡方检验和三种排列检验方法的I型错误率。在每种排列方法中,从999个排列的数据集中获得偏差的分布,如果观察到的统计量落在分布的上5%,则拒绝零假设。一种测试是对观察数据使用最佳跨距大小的条件置换测试;对于所有排列,此跨度大小均保持恒定。该测试显示出虚高的I型错误率。可替代地,可以事先确定跨度大小,使得跨度选择技术不依赖于观察到的数据。该测试显示无偏见;但是,跨度大小的选择不清楚。第三种方法是无条件置换测试,其中为观察和置换的数据集选择最佳的跨度大小。尽管计算量大,但该测试没有偏见。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号