首页> 外文会议>International symposium on intelligent data analysis >Condensed Representations in Presence of Missing Values
【24h】

Condensed Representations in Presence of Missing Values

机译:存在缺失值的浓缩表示

获取原文

摘要

Missing values are an old problem that is very common in real data bases. We describe the damages caused by missing values on condensed representations of patterns extracted from large data bases. This is important because condensed representations are very useful to increase the efficiency of the extraction and enable new uses of frequent patterns (e.g. rules with minimal body, clustering, classification). We show that, unfortunately, such condensed representations are unreliable in presence of missing values. We present a method of treatment of missing values for condensed representations based on δ-free or closed patterns, which are the most common condensed representations. This method provides an adequate condensed representation of these patterns. We show the soundness of our approach, both on a formal point of view and experimentally. Experiments are performed with our prototype MV_(MINER) (for Missing Values miner), which computes the collection of appropriate δ-free patterns.
机译:缺失的值是真实数据基础中非常常见的旧问题。我们描述了由大数据库提取的模式的凝结表示缺失的损害损失。这很重要,因为浓缩的表示是非常有用,可以提高提取的效率,并实现频繁模式的新用途(例如,具有最小的身体,聚类,分类)。我们表明,遗憾的是,在存在缺失的价值观时,这种凝聚态的表示是不可靠的。我们提出了一种基于Δ的无Δ或封闭式图案的凝结表示的缺失值的方法,这是最常见的凝聚表示。该方法提供了这些模式的足够浓缩表示。我们展示了我们的方法的健全性,无论是在正式的角度和实验上。使用我们的原型MV_(矿工)(对于缺失值矿器)进行实验,其计算适当的Δ的无模式的集合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号