【24h】

Tight Optimistic Estimates for Fast SubgroupDiscovery

机译:快速子组发现的紧乐观估计

获取原文

摘要

Subgroup discovery is the task of finding subgroups of a population which exhibit both distributional unusualness and high generality. Due to the non monotonicity of the corresponding evaluation functions, standard pruning techniques cannot be used for subgroup discovery, requiring the use of optimistic estimate techniques instead. So far, however, optimistic estimate pruning has only been considered for the extremely simple case of a binary target attribute and up to now no attempt was made to move beyond suboptimal heuristic optimistic estimates. In this paper, we show that optimistic estimate pruning can be developed into a sound and highly effective pruning approach for subgroup discovery. Based on a precise definition of optimality we show that previous estimates have been tight only in special cases. Thereafter, we present tight optimistic estimates for the most popular binary and multi-class quality functions, and present a family of increasingly efficient approximations to these optimal functions. As we show in empirical experiments, the use of our newly proposed optimistic estimates can lead to a speed up of an order of magnitude compared to previous approaches.
机译:子组发现是寻找具有分布异常性和高度普遍性的总体子组的任务。由于相应评估功能的非单调性,无法将标准修剪技术用于子组发现,而需要使用乐观估计技术。但是,到目前为止,仅对二进制目标属性的极其简单的情况考虑了乐观估计修剪,并且到目前为止,还没有尝试超越次优启发式乐观估计。在本文中,我们表明,乐观估计修剪可以被发展为用于子组发现的可靠且高效的修剪方法。根据最优性的精确定义,我们表明以前的估计仅在特殊情况下才严格。此后,我们对最流行的二进制和多类质量函数给出了严格的乐观估计,并给出了对这些最优函数越来越有效的近似值。正如我们在经验实验中所显示的,与以前的方法相比,使用我们新近提出的乐观估计可以导致数量级的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号