首页> 外文期刊>Statistica Sinica >A LACK-OF-FIT TEST WITH SCREENING IN SUFFICIENT DIMENSION REDUCTION
【24h】

A LACK-OF-FIT TEST WITH SCREENING IN SUFFICIENT DIMENSION REDUCTION

机译:在足够的尺寸减小中筛选缺乏拟合试验

获取原文
获取原文并翻译 | 示例
       

摘要

Researchers often need to infer how the conditional mean of a response varies with the predictors. Sufficient dimension-reduction techniques reduce the dimension by identifying a minimal set of linear combinations of the original predictors, without loss of information. This study tests whether a given small number of linear combinations of the original ultrahigh-dimensional covariates is sufficient to characterize the conditional mean of the response. We first introduce a novel consistent lack-of-fit test statistic for the case when the dimensionality of the covariates is moderate. The proposed test is shown to be n-consistent under the null hypothesis, and root-n-consistent under the alternative hypothesis. A bootstrap procedure is developed to approximate the p-values, and the consistency of the test is studied theoretically. To deal with the ultrahigh dimensionality, we introduce a two-stage lack-of-fit test with screening (LOFTS) procedure, based on a data-splitting strategy. The data are randomly partitioned into two equal halves. In the first stage, we apply the martingale difference correlation-based screening to one half of the data, and select a moderate set of covariates. In the second stage, we perform the proposed test, based on the selected covariates, using the second half of the data. The data-splitting strategy is crucial to eliminate the effect of spurious correlations and to prevent an increase in the type-I error rates. We also demonstrate the effectiveness of our two-stage test procedure by means of comprehensive simulations and a real-data application.
机译:研究人员经常需要推断出响应的条件均值如何随预测器而变化。通过识别原始预测器的最小线性组合,可以通过丢失信息来减少维度的足够的尺寸减少技术。该研究测试原始超高尺寸协变量的给定少量线性组合是否足以表征响应的条件均值。我们首先介绍一种新颖的一致缺乏适合的缺乏拟合的测试统计,因为协变量的维度为中等。所提出的试验显示在零假设下的n-一致,在替代假设下的根部N-一致。开发引导程序以近似p值,从理论上研究测试的一致性。要处理超高维度,我们根据数据分裂策略介绍了使用筛选(阁楼)程序的两级缺乏拟合测试。数据随机分为两个等分数。在第一阶段,我们将基于Martingale差异相关的屏幕应用于数据的一半,并选择一个中等的协变量。在第二阶段,我们根据所选择的协变量执行所提出的测试,使用下半部分。数据拆分策略对于消除杂散相关性的影响并防止类型-I错误率的增加至关重要。我们还通过全面模拟和实际数据应用展示了我们的两级测试程序的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号