首页> 美国卫生研究院文献>Journal of Healthcare Engineering >Outlier Removal in Model-Based Missing Value Imputation for Medical Datasets
【2h】

Outlier Removal in Model-Based Missing Value Imputation for Medical Datasets

机译:基于模型的医疗数据集缺失值估算中的异常值去除

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Many real-world medical datasets contain some proportion of missing (attribute) values. In general, missing value imputation can be performed to solve this problem, which is to provide estimations for the missing values by a reasoning process based on the (complete) observed data. However, if the observed data contain some noisy information or outliers, the estimations of the missing values may not be reliable or may even be quite different from the real values. The aim of this paper is to examine whether a combination of instance selection from the observed data and missing value imputation offers better performance than performing missing value imputation alone. In particular, three instance selection algorithms, DROP3, GA, and IB3, and three imputation algorithms, KNNI, MLP, and SVM, are used in order to find out the best combination. The experimental results show that that performing instance selection can have a positive impact on missing value imputation over the numerical data type of medical datasets, and specific combinations of instance selection and imputation methods can improve the imputation results over the mixed data type of medical datasets. However, instance selection does not have a definitely positive impact on the imputation result for categorical medical datasets.
机译:许多现实世界的医学数据集包含一定比例的缺失(属性)值。通常,可以执行缺失值插补来解决此问题,即通过基于(完整)观测数据的推理过程来提供缺失值的估计。但是,如果观察到的数据包含一些嘈杂的信息或异常值,则丢失值的估计可能不可靠,甚至可能与实际值完全不同。本文的目的是检验从观测数据中选择实例和缺失值插补的组合是否比单独执行缺失值插补提供了更好的性能。特别是,为了找到最佳组合,使用了三种实例选择算法DROP3,GA和IB3,以及三种插补算法KNNI,MLP和SVM。实验结果表明,执行实例选择可以对医疗数据集的数值数据类型上的缺失值插补产生积极影响,实例选择和插补方法的特定组合可以改善医疗数据集混合数据类型上的插补结果。但是,实例选择对分类医学数据集的插补结果没有肯定的积极影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号