首页> 外文期刊>Survey methodology >Model-assisted calibration of non-probability sample survey data using adaptive LASSO
【24h】

Model-assisted calibration of non-probability sample survey data using adaptive LASSO

机译:使用自适应LASSO的非概率样本调查数据的模型辅助校准

获取原文
获取原文并翻译 | 示例
       

摘要

The probability-sampling-based framework has dominated survey research because it provides precise mathematical tools to assess sampling variability. However increasing costs and declining response rates are expanding the use of non-probability samples, particularly in general population settings, where samples of individuals pulled from web surveys are becoming increasingly cheap and easy to access. But non-probability samples are at risk for selection bias due to differential access, degrees of interest, and other factors. Calibration to known statistical totals in the population provide a means of potentially diminishing the effect of selection bias in non-probability samples. Here we show that model calibration using adaptive LASSO can yield a consistent estimator of a population total as long as a subset of the true predictors is included in the prediction model, thus allowing large numbers of possible covariates to be included without risk of overfilling. We show that the model calibration using adaptive LASSO provides improved estimation with respect to mean square error relative to standard competitors such as generalized regression (GREG) estimators when a large number of covariates are required to determine the true model, with effectively no loss in efficiency over GREG when smaller models will suffice. We also derive closed form variance estimators of population totals, and compare their behavior with bootstrap estimators. We conclude with a real world example using data from the National Health Interview Survey.
机译:基于概率抽样的框架主导了调查研究,因为它提供了精确的数学工具来评估抽样的变异性。但是,成本增加和响应率下降正在扩大非概率样本的使用范围,特别是在一般人群中,从网络调查中抽取的个人样本变得越来越便宜且易于访问。但是由于差异性访问,关注程度和其他因素,非概率样本面临选择偏见的风险。对总体中已知统计总数的校准提供了一种潜在地减少非概率样本中选择偏倚影响的方法。在这里,我们表明,只要在预测模型中包含真实预测变量的子集,使用自适应LASSO的模型校准就可以得出总体总数的一致估计值,从而可以在不存在过度填充风险的情况下包含大量可能的协变量。我们显示,当需要大量协变量来确定真实模型时,使用自适应LASSO进行的模型校准相对于标准竞争对手(例如广义回归(GREG)估计器),提供了相对于均方误差的改进估计,实际上没有效率损失当较小的型号就足够时,可以超过GREG。我们还导出总体总数的闭合形式方差估计量,并将其行为与自举估计量进行比较。我们以来自“国家健康访问调查”的数据作为一个真实的例子作为结束。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号