【24h】

Pruning Relations for Substructure Discovery of Multi-relational Databases

机译:修剪关系的多关系数据库的子结构发现

获取原文
获取原文并翻译 | 示例

摘要

Multirelational data mining methods discover patterns across multiple interlinked tables (relations) in a relational database. In many large organizations, such a multi-relational database spans numerous departments and/or subdivisions, which are involved in different aspects of the enterprise such as customer profiling, fraud detection, inventory management, financial management, and so on. When considering multirelational classification, it follows that these subdivisions will express different interests in the data, leading to the need to explore various subsets of relevant relations with high utility with respect to the target class. The paper presents a novel approach for pruning the uninteresting relations of a relational database where relations come from such different parties and spans many classification tasks. We aim to create a pruned structure and thus minimize predictive performance loss on the final classification model. Our method identifies a set of strongly uncorrelated subgraphs to use for training and discards all others. The experiments performed demonstrate that our strategy is able to significantly reduce the size of the relational schema without sacrificing predictive accuracy.
机译:多关系数据挖掘方法发现关系数据库中多个相互关联的表(关系)的模式。在许多大型组织中,这样的多关系数据库跨越多个部门和/或子部门,这些部门和/或部门涉及企业的不同方面,例如客户分析,欺诈检测,库存管理,财务管理等。在考虑多关系分类时,可以得出以下结论:这些细分将在数据中表达不同的兴趣,从而导致需要针对目标类别探索具有较高效用的相关关系的各个子集。本文提出了一种修剪关系数据库中无趣的关系的新颖方法,其中关系来自这样的不同方,并且涉及许多分类任务。我们旨在创建修剪后的结构,从而最大程度地减少最终分类模型上的预测性能损失。我们的方法确定了一组高度不相关的子图用于训练,并丢弃所有其他子图。进行的实验表明,我们的策略能够在不牺牲预测准确性的情况下显着减小关系模式的大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号