首页> 外文学位 >To weight or to adjust: An empirical study of the design-based and model-based approaches.
【24h】

To weight or to adjust: An empirical study of the design-based and model-based approaches.

机译:权重或调整:基于设计和基于模型的方法的实证研究。

获取原文
获取原文并翻译 | 示例

摘要

When a sampling design is correlated to the dependent variable, then the distribution of the sampled units is different from that obtained from a simple random sampling design. Then the sampling design is informative, in the sense that if the design variables were not included in the analysis model, even conditional on the covariates, the estimated model parameters can be biased.;Questions have been asked about how survey data are modeled when sampling designs are informative. Two fundamental methodologies, design-based and model-based, have been proposed to address this issue. A model-based method--so-called sample distribution method, has been proposed by Krieger and Pfeffermann (1992; 1997) to extract the model of the sample data as a function of the model holding in the population and the sampling design. Once the model holding in the sample data is derived, the standard model-based analysis techniques can be applied to estimate the unknown population parameters. The core topic of this dissertation is to assess various modeling strategies and estimators of regression coefficients and their variance---both design-based and model-based, in particular, the sample distribution method, under the informative sampling design, and to develop a modeling strategy for analysts who are facing this design-based or model-based dilemma.;The dissertation is comprised of three research papers that provide (1) an evaluation of the design-based and model-based estimators under a single-stage informative sampling design; (2) an assessment of design-based and model-based estimators under an informative two-stage clustering sampling design; (3) a joint treatment of informative sampling and unit dropouts in longitudinal studies.;When a single-stage sampling design is informative, the model-based naive method---either ordinary least square or maximum likelihood, produces biased results. The design-based method reduces the amount of biases for some parameters (e.g. intercept) but increases variances, which may lead to too conservative conclusions. The sample distribution method produces better estimates in the term of having smaller biases and variances than the naive and design-based methods.;Under an informative two-stage clustering sampling design, ignoring the sampling effect, the model-based naive method produces biased results. Under some specific assumptions, the sample distribution method produces better estimators in terms of smaller biases and higher coverage rates compared to the naive method and the design-based multilevel pseudo likelihood method. Although many previous studies have shown that multilevel pseudo likelihood method is preferred to compensate for the sampling design, this study shows that a rather simpler method---the sample distribution method can be used to address the design effect.;In a specific statistical setting, the relative performance of the design-based and the model-based methods for compensating the informative sampling design and dropout has been investigated. The simulation results indicate that both the model-based and the design-based approaches generally work well in the missing at random and missing not at random settings. Moreover, the sample distribution method combined with the Diggle and Kenward model has advantages of correcting the design effect and the nonignorable dropout.
机译:当抽样设计与因变量相关时,则抽样单位的分布与从简单随机抽样设计获得的分布不同。这样的抽样设计是有益的,即如果分析模型中不包含设计变量(即使以协变量为条件),则估计的模型参数可能会产生偏差。;有人提出了有关抽样时如何对调查数据进行建模的问题设计是有益的。已经提出了两种基本方法,即基于设计和基于模型的方法来解决此问题。 Krieger和Pfeffermann(1992; 1997)提出了一种基于模型的方法-所谓的样本分发方法,以根据人口中拥有的模型和抽样设计来提取样本数据的模型。一旦得出样本数据中的模型,就可以应用基于标准模型的分析技术来估计未知的总体参数。本文的核心课题是在信息抽样设计的基础上,评估各种建模策略和回归系数及其方差的估计量-基于设计和基于模型的样本分配方法,并制定一套回归系数估计方法。面临这种基于设计或基于模型的难题的分析师的建模策略。论文由三篇研究论文组成,这些论文提供了(1)在单阶段信息采样下对基于设计和基于模型的估计量的评估设计; (2)在信息性的两阶段聚类抽样设计下,对基于设计和基于模型的估计量进行评估; (3)纵向研究中信息性抽样和单位辍学的联合处理;当单阶段抽样设计具有信息性时,基于模型的朴素方法-普通最小二乘或最大似然法会产生有偏差的结果。基于设计的方法减少了某些参数(例如截距)的偏差量,但增加了方差,这可能导致过于保守的结论。与基于朴素方法和基于设计的方法相比,样本分配方法在具有较小偏差和方差的方面产生了更好的估计。在信息丰富的两阶段聚类抽样设计中,忽略了采样效果,基于模型的朴素方法产生了有偏差的结果。 。在某些特定假设下,与朴素方法和基于设计的多级伪似然方法相比,样本分配方法在较小的偏差和较高的覆盖率方面产生了更好的估计量。尽管以前的许多研究表明,采用多层伪似然法来补偿抽样设计是可取的,但这项研究表明,可以使用一种更简单的方法-样本分配法来解决设计效果。 ,研究了基于设计和基于模型的方法的相对性能,以补偿信息性抽样设计和辍学。仿真结果表明,基于模型的方法和基于设计的方法通常都可以很好地解决随机缺失和随机缺失的问题。此外,将样本分配方法与Diggle和Kenward模型相结合,具有纠正设计效果和不可忽略遗漏的优势。

著录项

  • 作者

    Cai, Tianji.;

  • 作者单位

    The University of North Carolina at Chapel Hill.;

  • 授予单位 The University of North Carolina at Chapel Hill.;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 113 p.
  • 总页数 113
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:36:57

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号