首页> 外文会议>European conference on machine learning and knowledge discovery in databases >Difference-Based Estimates for Generalization-Aware Subgroup Discovery
【24h】

Difference-Based Estimates for Generalization-Aware Subgroup Discovery

机译:基于差异的泛化感知子组发现估计

获取原文

摘要

For the task of subgroup discovery, generalization-aware interesting measures that are based not only on the statistics of the patterns itself, but also on the statistics of their generalizations have recently been shown to be essential. A key technique to increase runtime performance of subgroup discovery algorithms is the application of optimistic estimates to limit the search space size. These are upper bounds for the interestingness that any specialization of the currently evaluated pattern may have. Until now these estimates are based on the anti-monotonicity of instances, which are covered by the current pattern. This neglects important properties of generalizations. Therefore, we present in this paper a new scheme of deriving optimistic estimates for generalization aware subgroup discovery, which is based on the instances by which patterns differ in comparison to their generalizations. We show, how this technique can be applied for the most popular interestingness measures for binary as well as for numeric target concepts. The novel bounds are incorporated in an efficient algorithm, which outperforms previous methods by up to an order of magnitude.
机译:对于子组发现的任务,最近已证明,不仅基于模式本身的统计信息,而且基于其概括性的统计信息,具有泛化意识的有趣措施非常重要。提高子组发现算法的运行时性能的一项关键技术是应用乐观估计来限制搜索空间的大小。这些是当前评估模式的任何专业化可能具有的趣味性的上限。到目前为止,这些估计是基于实例的反单调性进行的,该实例已被当前模式所涵盖。这忽略了概括的重要属性。因此,我们在本文中提出了一种新的方案,该方案可为泛化感知子组发现推导乐观估计,该方案基于与泛化相比模式有所不同的实例。我们展示了如何将该技术应用于二进制以及数字目标概念的最流行的兴趣度度量。新颖的边界被合并到一个有效的算法中,该算法的性能要比以前的方法高出一个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号