A new method of multiple imputation for completely (or almost completely) missing data

机译：一种完全（或几乎完全）丢失数据的多重插补新方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

One of the important questions the researcher must answer assessing data quality while preparing information for a data mining procedure is whether missing observations in the dataset are missing at random, and whether some form of imputation is needed. If all (or almost all) observations of a variable are missing, they cannot be classified as missing at random. Therefore, most known methods of imputation of missing values cannot be applied to this variable. This paper studies a particular way for creating imputations in datasets containing completely (or almost completely) missing variables. As it is shown in the paper, if no external data are available, the maximum entropy distribution is the only reasonable probability distribution for producing proper imputation in case of such variables. Two examples of real-life epidemiological studies demonstrate this approach.

机译：研究人员在为数据挖掘程序准备信息时必须回答的评估数据质量的重要问题之一是，是否随机丢失了数据集中的缺失观测值，以及是否需要某种形式的插补。如果缺少变量的所有（或几乎所有）观察值，则不能将它们随机分类为缺失。因此，大多数估算缺失值的方法无法应用于此变量。本文研究了一种在包含完全（或几乎完全）缺失变量的数据集中创建插补的特殊方法。如本文所示，如果没有外部数据可用，则在这种变量的情况下，最大熵分布是产生适当推算的唯一合理概率分布。现实生活中的流行病学研究的两个例子证明了这种方法。

著录项

来源
《Advances in mathematical and computational methods》|2010年|p.34-45|共12页
会议地点 Faro(PT);Faro(PT)
作者
ARKADY BOLOTIN;
展开▼
作者单位

Epidemiology Department Ben-Gurion University of the Negev Beersheba, Israel;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
missing variables; non-random missingness; maximum entropy distributions;

机译：缺少变量；非随机失踪；最大熵分布;

相似文献

外文文献
中文文献
专利

1. Multiple imputation of completely missing repeated measures data within person from a complex sample: application to accelerometer data in the National Health and Nutrition Examination Survey [J] . Liu Benmei, Yu Mandi, Graubard Barry I., Statistics in medicine . 2016,第28期

机译：复杂样本中人体内完全缺失的重复测量数据的多重估算：在国家健康和营养检查调查中应用于加速度计数据
2. Multiple imputation of completely missing repeated measures data within person from a complex sample: application to accelerometer data in the National Health and Nutrition Examination Survey [J] . Liu Benmei, Yu Mandi, Graubard Barry I., Statistics in medicine . 2016,第28期

机译：从复杂的样本中，多重丢失的重复措施数据的重复措施数据：应用于国家健康和营养考试调查中的加速度计数据
3. Comparative Study of Four Methods in Missing Value Imputations under Missing Completely at Random Mechanism [J] . Michikazu Nakai, Ding-Geng Chen, Kunihiro Nishimura, Open Journal of Statistics . 2014,第1期

机译：随机机制完全缺失下四种缺失值估算方法的比较研究
4. AUGMENTED STOCHASTIC MULTIPLE IMPUTATION MODEL FOR AIRPORT PAVEMENT MISSING DATA IMPUTATION [C] . J. Farhan, T. F. Fwa Annual meeting of the transportation research board;Transportation Research Board . 2014

机译：用于机场铺面缺失数据插补的增强随机多插补模型
5. Evaluating Multiple Imputation Methods for Longitudinal Healthy Aging Index—A Score Variable with Data Missing Due to Death, Dropout and Several Missing Data Mechanisms [D] . Kane, Elizabeth L. 2017

机译：纵向健康老龄化指数的多种估算方法的评估-一个因死亡，辍学和几种缺失数据机制导致数据缺失的得分变量
6. Multiple Imputation of Completely Missing Repeated Measures Data within Person from a Complex Sample: Application to Accelerometer Data in the National Health and Nutrition Examination Survey [O] . Benmei Liu, Mandi Yu, Barry I Graubard, -1

机译：复杂样本中人体内完全丢失的重复测量数据的多重估算：在国家健康和营养检查调查中的加速度计数据中的应用
7. Comparative Study of Four Methods in Missing Value Imputations under Missing Completely at Random Mechanism [O] . Michikazu Nakai, Ding-Geng Chen, Kunihiro Nishimura, 2014

机译：随机机制缺失缺失缺失价值避难所的四种方法的比较研究

A new method of multiple imputation for completely (or almost completely) missing data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅