首页> 外文期刊>Statistics and Its Interface >Nonparametric statistical inference and imputation for incomplete categorical data
【24h】

Nonparametric statistical inference and imputation for incomplete categorical data

机译:不完全分类数据的非参数统计推断和估算

获取原文
获取原文并翻译 | 示例
           

摘要

Missingness in categorical data is a common problem in various real applications. Traditional approaches either utilize only the complete observations or impute the missing data by some ad hoc methods rather than the true conditional distribution of the missing data, thus losing or distorting the rich information in the partial observations. In this paper, we propose the Dirichlet Process Mixture of Collapsed Product-Multinomials (DPMCPM) to model the full data jointly and compute the model efficiently. By fitting an infinite mixture of product-multinomial distributions, DPMCPM is applicable for any categorical data regardless of the true distribution, which may contain complex association among variables. Under the framework of latent class analysis, we show that DPMCPM can model general missing mechanisms by creating an extra category to denote missingness, which implicitly integrates out the missing part with regard to their true conditional distribution. Through simulation studies and a real application, we demonstrate that DPMCPM outperforms existing approaches on statistical inference and imputation for incomplete categorical data of various missing mechanisms. DPMCPM is implemented as the R package MMDai, which is available from the Comprehensive R Archive Network at https://cran.r-project.org/ web/packages/MMDai/index.html.
机译:分类数据中的遗失是各种真实应用中的常见问题。传统方法仅利用完整的观察或通过某些ad hoc方法赋予丢失的数据,而不是缺少数据的真实条件分布,从而在部分观测中丢失或扭曲丰富的信息。在本文中,我们提出了折叠产品 - 多人(DPMCPM)的Dirichlet工艺混合物来共同建模完整数据并有效地计算模型。通过拟合产品 - 多项分布的无限混合物,DPMCPM适用于任何分类数据,无论真正的分布如何,都可能包含变量之间的复杂关联。在潜在阶级分析的框架下,我们展示了DPMCPM可以通过创建额外的类别来模拟一般丢失的机制,以表示缺失,这隐式集成了缺失部分的真实条件分布。通过仿真研究和实际应用,我们证明DPMCPM优于各种缺失机制的不完全分类数据的统计推理和估算的现有方法。 DPMCPM实现为R包MMDAI,可从HTTPS://cran.r-project.org/ Web / Packages / MMDAI / index.html中获得的全综合R存档网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号