Multivariate imputation of coarsened survey data on household wealth.

机译：关于家庭财富的粗略调查数据的多元估算。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sample survey questions that attempt to measure financial variables such as household income, assets and debts are subject to high rates of missing data. To counter these high rates of missing data, survey researchers now use special questionnaire formats designed to collect interval scale or "bracketed" observations whenever a respondent is unable or unwilling to provide an exact response to a financial amount question. These special question formats significantly reduce the rate of missing data but result in a coarsened mixture of actual value responses, bracketed amounts and completely missing data. Multivariate modeling of these data is complicated by both the coarsened measurements and the fact that the distribution of each variable is a semi-continuous mixture of zeroes and continuous, non-zero amounts.A mixed normal location model is proposed for semi-continuous multivariate data. The Expectation-Maximization (EM) algorithm is developed for estimating the parameters of the mixed normal location model with coarsened data. A Bayesian Gibbs Sampler algorithm is also developed for simulating draws from the posterior predictive distribution of coarsened observations from the mixed normal location model and developing multiple imputation inferences for the parameters of this multivariate model.The Gibbs sampler algorithm for coarsened data from the mixed normal location model is used to multiply impute coarsened asset and liability values in the 1992 Health and Retirement Survey data set. The estimated distribution of household net worth based on these imputations is compared to the distributions estimated by complete case analysis and simple univariate imputation alternatives including the univariate hot deck method used to impute coarsened values in the HRS Wave 1 public use data set.The performance of the EM and Gibbs sampler algorithms is tested in a simulation study that investigates the relative bias, root mean square error, and confidence interval width and coverage properties of these methods for different degrees of coarsening of the data, ignorable and nonignorable coarsening mechanisms and departures from normality in the multivariate data model. Performance of the EM and the Gibbs sampler algorithms is compared to complete case analysis, mean imputation and univariate hot deck imputation methods.

机译：试图测量诸如家庭收入，资产和债务之类的金融变量的抽样调查问题容易导致数据丢失的可能性很高。为了应对这些高比例的丢失数据，调查研究人员现在使用特殊的问卷格式，旨在在受访者无法或不愿对财务金额问题提供准确答案时收集间隔量表或“括弧式”观察。这些特殊的问题格式显着降低了丢失数据的比率，但导致实际值响应，括号内的数量和完全丢失的数据的混合变粗。这些数据的多变量建模由于粗化的测量结果以及每个变量的分布是零和连续的非零量的半连续混合的事实而变得复杂。针对半连续的多变量数据，提出了混合正态位置模型。开发了Expectation-Maximization（EM）算法，用于估计带有粗化数据的混合法线位置模型的参数。还开发了一种贝叶斯Gibbs采样器算法，用于模拟来自混合法线位置模型的粗化观测值的后验预测分布的绘图，并为该多元模型的参数开发多个插补推断.Gibbs采样器算法用于从混合法线位置进行粗化数据该模型用于在1992年健康与退休调查数据集中将估算的粗化资产和负债值相乘。将基于这些推算得出的家庭净资产的估计分布与通过完整案例分析和简单的单变量推算替代方法（包括用于在HRS Wave 1公共用途数据集中推算粗化值的单变量热甲板方法）估算的分布进行比较。 EM和Gibbs采样器算法在模拟研究中进行了测试，该研究研究了这些方法的相对偏差，均方根误差以及置信区间宽度和覆盖范围属性，以针对数据的不同程度的粗化，可忽略和不可忽略的粗化机制以及与多元数据模型中的正态性。将EM和Gibbs采样器算法的性能与完整案例分析，均值插补和单变量热甲板插补方法进行了比较。

著录项

作者
Heeringa, Steven George.;
展开▼
作者单位

University of Michigan.;

展开▼
授予单位 University of Michigan.;
学科 Statistics.
学位 Ph.D.
年度 2000
页码 170 p.
总页数 170
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Imputation of Household Survey Data Using Linear Mixed Models [J] . Lago Luise Patricia, Clark Robert Graham Australian & New Zealand journal of statistics . 2015,第2期

机译：使用线性混合模型估算住户调查数据
2. Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation. [J] . Lee KJ, Carlin JB American Journal of Epidemiology . 2010,第5期

机译：缺失数据的多重插补：完全条件指定与多元正态插补。
3. Multiple Imputation for Missing Data: Fully Conditional Specification Versus Multivariate Normal Imputation [J] . Katherine J. Lee and John B. Carlin merican Journal of Epidemiology . 2010,第5期

机译：缺失数据的多重插补：完全条件规范与多元正态插补
4. The Cleveland GPS Household Travel Survey: Survey Design, Imputation of Trip Characteristics, and Secondary Uses of the Data [C] . Jeremy Wilhelm, Jean Wolf, Eunah Kang, Annual meeting of the transportation research board;Transportation Research Board . 2014

机译：克利夫兰GPS家庭旅行调查：调查设计，旅行特征估算和数据的二次使用
5. Handling Incomplete High-Dimensional Multivariate Longitudinal Data with Mixed Data Types by Multiple Imputation Using a Longitudinal Factor Analysis Model. [D] . Lu, Xiang. 2016

机译：使用纵向因素分析模型通过多重插补处理具有混合数据类型的不完整的高维多元纵向数据。
6. On multivariate imputation and forecasting of decadal wind speed missing data [O] . Ronald Wesonga -1

机译：十年风速缺失数据的多元估算与预报
7. Multiple imputation of unordered categorical missing data: A comparison of the multivariate normal imputation and multiple imputation by chained equations [O] . Innocent Karangwa, Danelle Kotze, Renette Blignaut 2016

机译：无序分类缺失数据的多重归发：链接方程的多变量正常避难和多重归纳的比较

Multivariate imputation of coarsened survey data on household wealth.

摘要

著录项

相似文献

相关主题

期刊订阅