...
首页> 外文期刊>Journal of Multivariate Analysis: An International Journal >Normal distribution based pseudo ML for missing data: With applications to mean and covariance structure analysis
【24h】

Normal distribution based pseudo ML for missing data: With applications to mean and covariance structure analysis

机译:基于正态分布的伪ML用于丢失数据:应用于均值和协方差结构分析

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

When missing data are either missing completely at random (MCAR) or missing at random (MAR), the maximum likelihood (ML) estimation procedure preserves many of its properties. However, in any statistical modeling, the distribution specification for the likelihood function is at best only an approximation to the real world. In particular, since the normal-distribution-based ML is typically applied to data with heterogeneous marginal skewness and kurtosis, it is necessary to know whether such a practice still generates consistent parameter estimates. When the manifest variables are linear combinations of independent random components and missing data are MAR, this paper shows that the normal-distribution-based MLE is consistent regardless of the distribution of the sample. Examples also show that the consistency of the MLE is not guaranteed for all nonnormally distributed samples. When the population follows a confirmatory factor model, and data are missing due to the magnitude of the factors, the MLE may not be consistent even when data are normally distributed. When data are missing due to the magnitude of measurement errors/uniqueness, MLEs for many of the covariance parameters related to the missing variables are still consistent. This paper also identifies and discusses the factors that affect the asymptotic biases of the MLE when data are not missing at random. In addition, the paper also shows that, under certain data models and MAR mechanism, the MLE is asymptotically normally distributed and the asymptotic covariance matrix is consistently estimated by the commonly used sandwich-type covariance matrix. The results indicate that certain formulas and/or conclusions in the existing literature may not be entirely correct.
机译:当丢失的数据完全随机丢失(MCAR)或随机丢失(MAR)时,最大似然(ML)估计过程将保留其许多属性。但是,在任何统计模型中,似然函数的分布规范充其量仅是与现实世界的近似值。特别是,由于通常将基于正态分布的ML应用于具有不同边际偏度和峰度的数据,因此有必要知道这种做法是否仍生成一致的参数估计值。当清单变量是独立随机分量的线性组合且缺失数据是MAR时,本文表明,无论样本的分布如何,基于正态分布的MLE都是一致的。示例还显示,对于所有非正态分布的样本,均不能保证MLE的一致性。当总体遵循验证性因子模型,并且由于因子的大小而导致数据丢失时,即使数据呈正态分布,MLE也可能不一致。当由于测量误差/唯一性的大小而导致数据丢失时,与丢失变量相关的许多协方差参数的MLE仍保持一致。本文还确定并讨论了当随机数据不丢失时影响MLE渐近偏差的因素。此外,本文还表明,在某些数据模型和MAR机制下,MLE是渐近正态分布的,并且通过常用的三明治型协方差矩阵可以一致地估计渐近协方差矩阵。结果表明,现有文献中的某些公式和/或结论可能并不完全正确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号