首页> 外文会议>IEEE International Conference on Data Mining Workshops >Mining Predictive Redescriptions with Trees
【24h】

Mining Predictive Redescriptions with Trees

机译:用树挖掘预测性重新描述

获取原文

摘要

In many areas of science, scientists need to find distinct common characterizations of the same objects and, vice versa, identify sets of objects that admit multiple shared descriptions. For example, a biologist might want to find a set of bioclimatic conditions and a set of species, such that this bioclimatic profile adequately characterizes the areas inhabited by these fauna. In data analysis, the task of automatically generating such alternative characterizations is called redescription mining. A number of algorithms have been proposed for mining redescriptions which usually differ on the type of redescriptions they construct. In this paper, we demonstrate the power of tree-based redescriptions and present two new algorithms for mining them. Tree-based redescriptions can have very strong predictive power (i.e. they generalize well to unseen data), but unfortunately they are not always easy to interpret. To alleviate this major drawback, we present an adapted visualization, integrated into an existing interactive mining framework.
机译:在科学的许多领域中,科学家都需要找到相同对象的独特共同特征,反之亦然,确定允许多个共享描述的一组对象。例如,生物学家可能希望找到一组生物气候条件和一组物种,以使这种生物气候特征充分表征这些动物居住的区域。在数据分析中,自动生成此类替代特征的任务称为重新定义挖掘。已经提出了许多用于挖掘重定义的算法,这些算法通常在其构造的重定义类型上有所不同。在本文中,我们演示了基于树的重新描述的功能,并提出了两种用于挖掘它们的新算法。基于树的重述可以具有非常强的预测能力(即,它们可以很好地泛化到看不见的数据),但是不幸的是,它们并不总是易于解释。为了缓解这一主要缺点,我们提出了一种经过改进的可视化工具,该可视化工具已集成到现有的交互式挖掘框架中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号