【24h】

Redescription Mining with Multi-target Predictive Clustering Trees

机译:使用多目标预测聚类树进行重新挖掘

获取原文

摘要

Redescription mining is a field of knowledge discovery that aims to find different descriptions of subsets of elements in the data by using two or more disjoint sets of descriptive attributes. The ability to find connections between different sets of descriptive attributes and provide a more comprehensive set of rules makes it very useful in practice. In this work, we introduce redescription mining algorithm for generating and iteratively improving a redescription set of user defined size based on multi-target Predictive Clustering Trees. This approach uses information about element membership in different generated rules to search for new redescriptions and is able to produce highly accurate, statistically significant redescriptions described by Boolean, nominal or numeric attributes. As opposed to current tree-based approaches that use multi-class or binary classification, we explore benefits of using multi target classification and regression to create redescriptions. The process of iterative redescription set improvement is illustrated on the dataset describing 199 world countries and their trading patterns. The performance of the algorithm is compared against the state of the art redescription mining algorithms.
机译:重定义挖掘是知识发现的领域,其目的是通过使用两个或多个不相交的描述性属性集来找到数据中元素子集的不同描述。在不同的描述性属性集之间找到联系并提供更全面的规则的能力使其在实践中非常有用。在这项工作中,我们介绍了基于多目标预测聚类树的生成和迭代改进用户定义大小的重新定义集的重新定义挖掘算法。此方法使用有关不同生成规则中元素成员资格的信息来搜索新的重新定义,并且能够生成由布尔,名义或数字属性描述的高度准确的,具有统计意义的重新定义。与使用多类或二进制分类的当前基于树的方法相反,我们探索了使用多目标分类和回归来创建重新描述的好处。数据集描述了199个世界国家及其贸易模式,说明了迭代重新定义集改进的过程。将算法的性能与最新的重新定义挖掘算法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号