首页> 外文学位 >Multivariate imputation of coarsened survey data on household wealth.
【24h】

Multivariate imputation of coarsened survey data on household wealth.

机译:关于家庭财富的粗略调查数据的多元估算。

获取原文
获取原文并翻译 | 示例

摘要

Sample survey questions that attempt to measure financial variables such as household income, assets and debts are subject to high rates of missing data. To counter these high rates of missing data, survey researchers now use special questionnaire formats designed to collect interval scale or "bracketed" observations whenever a respondent is unable or unwilling to provide an exact response to a financial amount question. These special question formats significantly reduce the rate of missing data but result in a coarsened mixture of actual value responses, bracketed amounts and completely missing data. Multivariate modeling of these data is complicated by both the coarsened measurements and the fact that the distribution of each variable is a semi-continuous mixture of zeroes and continuous, non-zero amounts.A mixed normal location model is proposed for semi-continuous multivariate data. The Expectation-Maximization (EM) algorithm is developed for estimating the parameters of the mixed normal location model with coarsened data. A Bayesian Gibbs Sampler algorithm is also developed for simulating draws from the posterior predictive distribution of coarsened observations from the mixed normal location model and developing multiple imputation inferences for the parameters of this multivariate model.The Gibbs sampler algorithm for coarsened data from the mixed normal location model is used to multiply impute coarsened asset and liability values in the 1992 Health and Retirement Survey data set. The estimated distribution of household net worth based on these imputations is compared to the distributions estimated by complete case analysis and simple univariate imputation alternatives including the univariate hot deck method used to impute coarsened values in the HRS Wave 1 public use data set.The performance of the EM and Gibbs sampler algorithms is tested in a simulation study that investigates the relative bias, root mean square error, and confidence interval width and coverage properties of these methods for different degrees of coarsening of the data, ignorable and nonignorable coarsening mechanisms and departures from normality in the multivariate data model. Performance of the EM and the Gibbs sampler algorithms is compared to complete case analysis, mean imputation and univariate hot deck imputation methods.
机译:试图测量诸如家庭收入,资产和债务之类的金融变量的抽样调查问题容易导致数据丢失的可能性很高。为了应对这些高比例的丢失数据,调查研究人员现在使用特殊的问卷格式,旨在在受访者无法或不愿对财务金额问题提供准确答案时收集间隔量表或“括弧式”观察。这些特殊的问题格式显着降低了丢失数据的比率,但导致实际值响应,括号内的数量和完全丢失的数据的混合变粗。这些数据的多变量建模由于粗化的测量结果以及每个变量的分布是零和连续的非零量的半连续混合的事实而变得复杂。针对半连续的多变量数据,提出了混合正态位置模型。开发了Expectation-Maximization(EM)算法,用于估计带有粗化数据的混合法线位置模型的参数。还开发了一种贝叶斯Gibbs采样器算法,用于模拟来自混合法线位置模型的粗化观测值的后验预测分布的绘图,并为该多元模型的参数开发多个插补推断.Gibbs采样器算法用于从混合法线位置进行粗化数据该模型用于在1992年健康与退休调查数据集中将估算的粗化资产和负债值相乘。将基于这些推算得出的家庭净资产的估计分布与通过完整案例分析和简单的单变量推算替代方法(包括用于在HRS Wave 1公共用途数据集中推算粗化值的单变量热甲板方法)估算的分布进行比较。 EM和Gibbs采样器算法在模拟研究中进行了测试,该研究研究了这些方法的相对偏差,均方根误差以及置信区间宽度和覆盖范围属性,以针对数据的不同程度的粗化,可忽略和不可忽略的粗化机制以及与多元数据模型中的正态性。将EM和Gibbs采样器算法的性能与完整案例分析,均值插补和单变量热甲板插补方法进行了比较。

著录项

  • 作者

    Heeringa, Steven George.;

  • 作者单位

    University of Michigan.;

  • 授予单位 University of Michigan.;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2000
  • 页码 170 p.
  • 总页数 170
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号