首页> 外文期刊>Journal of Intelligent Information Systems >Redescription mining augmented with random forest of multi-target predictive clustering trees
【24h】

Redescription mining augmented with random forest of multi-target predictive clustering trees

机译:多目标预测聚类树的随机森林增强了重新描述挖掘

获取原文
获取原文并翻译 | 示例
       

摘要

In this work, we present a redescription mining algorithm that uses Random Forest of Predictive Clustering Trees (RFPCTs) for generating and iteratively improving a set of redescriptions. The approach uses information about element membership in different queries, generated from a single constructed PCT, to explore redescription space, while queries obtained from the Random Forest of PCTs increase candidate diversity. The approach is able to produce highly accurate, statistically significant redescriptions described by Boolean, nominal or numerical attributes. As opposed to current tree-based approaches that use multi-class or binary classification, we explore the benefits of using multi-label classification and multi-target regression to create redescriptions. Major benefit of the approach, compared to other state of the art solutions, is that it does not require specifying minimal threshold on redescription accuracy to obtain highly accurate, optimized set of redescriptions. The process of Random Forest based augmentation and different modes of redescription set creation are evaluated on three datasets with different properties. We use the same datasets to compare the performance of our algorithm to state of the art redescription mining approaches.
机译:在这项工作中,我们提出了一种重新定义挖掘算法,该算法使用预测性聚类树的随机森林(RFPCT)生成和迭代地改进了一组重新描述。该方法使用有关从单个构造的PCT生成的不同查询中元素成员资格的信息来探索重新描述空间,而从PCT随机森林中获取的查询会增加候选者的多样性。该方法能够产生由布尔,名义或数字属性描述的高度准确的,具有统计意义的重述。与当前使用多类或二进制分类的基于树的方法相反,我们探索了使用多标签分类和多目标回归创建重新描述的好处。与其他现有技术解决方案相比,该方法的主要优点在于,它不需要为重新定义精度指定最小阈值即可获得高度准确的优化重新定义集。在具有不同属性的三个数据集上评估了基于随机森林的扩充过程和不同的重新定义集创建模式。我们使用相同的数据集将算法的性能与最新的重定义挖掘方法进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号