首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery >Partitioned Approach to Association Rule Mining over Multiple Databases
【24h】

Partitioned Approach to Association Rule Mining over Multiple Databases

机译:多个数据库关联规则挖掘的分区方法

获取原文

摘要

Database mining is the process of extracting interesting and previously unknown patterns and correlations from data stored in Data Base Management Systems (DBMSs). Association rule mining is the process of discovering items, which tend to occur together in transactions. If the data to be mined were stored as relations in multiple databases, instead of moving data from one database to another, a partitioned approach would be appropriate. This paper addresses the partitioned approach to association rule mining for data stored in multiple Relational DBMSs. This paper proposes an approach that is very effective for partitioned databases as compared to the main memory partitioned approach. Our approach uses SQL-based K-way join algorithm and its optimizations. A second alternative that trades accuracy for performance is also presented. Our results indicate that beyond a certain size of data sets, the accuracy is preserved in addition to improving performance. Extensive experiments have been performed and results are presented for the two partitioned approaches using IBM DB2/UDB and Oracle 8i.
机译:数据库挖掘是从存储在数据库管理系统(DBMS)中的数据中提取有趣和先前未知的模式和相关性的过程。关联规则挖掘是发现项目的过程,往往会在交易中一起发生。如果要挖掘的数据被存储为多个数据库中的关系,而不是将数据从一个数据库移动到另一个数据库,而不是将数据移动到另一个数据库,则是一个分区方法是合适的。本文讨论了在多个关系DBMS中存储的数据的关联规则挖掘的分区方法。本文提出了一种对分区数据库非常有效的方法,与主要内存分区方法相比。我们的方法使用基于SQL的K-Way加入算法及其优化。还介绍了交易性能准确性的第二种选择。我们的结果表明,除了一定规模的数据集之外,除了提高性能之外,还保留了精度。已经进行了广泛的实验,并使用IBM DB2 / UDB和Oracle 8i呈现了两个分区方法的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号