首页> 外文会议>SIAM International Conference on Data Mining >Making Kernel Density Estimation Robust towards Missing Values in Highly Incomplete Multivariate Data without Imputation
【24h】

Making Kernel Density Estimation Robust towards Missing Values in Highly Incomplete Multivariate Data without Imputation

机译:使内核密度估计在不归尽的高度不完全多变量数据中稳健地稳健

获取原文

摘要

Density estimation is one of the most frequently used data analytics techniques. A major challenge of real-world datasets is missing values, originating e.g. from sampling errors or data loss. The recovery of these is often impossible or too expensive. Missing values are not necessarily limited to a few features or samples, rendering methods based on complete auxiliary variables unsuitable. In this paper we introduce three models able to deal with such datasets. They are based on the new concept of virtual objects. Additionally, we present a computationally efficient approximation. Generalizing KDE, our methods are called Warp-KDE. Experiments with incomplete datasets show that Warp-KDE methods are superior to established imputation methods.
机译:密度估计是最常用的数据分析技术之一。实际数据集的主要挑战是缺失的值,源于例如,从采样错误或数据丢失。恢复这些通常是不可能的或太贵。缺失的值不一定限于少数特征或样本,基于完全辅助变量不合适的渲染方法。在本文中,我们介绍了三种能够处理此类数据集的模型。它们基于虚拟对象的新概念。此外,我们介绍了计算有效的近似。概括KDE,我们的方法称为WARP-KDE。不完全数据集的实验表明,WARP-KDE方法优于建立的载体方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号