首页> 外文会议>Annual Hawaii International Conference on System Sciences >Ad-Hoc Association-Rule Mining within the Data Warehouse
【24h】

Ad-Hoc Association-Rule Mining within the Data Warehouse

机译:数据仓库内的ad-hoc协会规则挖掘

获取原文

摘要

Many organizations often underutilize their already constructed data warehouses. In this paper, we suggest a novel way of acquiring more information from corporate data warehouses without the complications and drawbacks of deploying additional software systems. Association-rule mining, which captures co-occurrence patterns within data, has attracted considerable efforts from data warehousing researchers and practitioners alike. Unfortunately, most data mining tools are loosely coupled, at best, with the data warehouse repository. Furthermore, these tools can often find association rules only within the main fact table of the data warehouse (thus ignoring the information-rich dimensions of the star schema) and are not easily applied on non-transaction level data often found in data warehouses. In this paper, we present a new data-mining framework that is tightly integrated with the data warehousing technology. Our framework has several advantages over the use of separate data mining tools. First, the data stays at the data warehouse, and thus the management of security and privacy issues is greatly reduced. Second, we utilize the query processing power of a data warehouse itself, without using a separate data-mining tool. In addition, this framework allows ad-hoc data mining queries over the whole data warehouse, not just over a transformed portion of the data that is required when a standard data-mining tool is used. Finally, this framework also expands the domain of association-rule mining from transaction-level data to aggregated data as well.
机译:许多组织经常不利于他们已经构建的数据仓库。在本文中,我们建议一种新的方式来从公司数据仓库获取更多信息,而无需部署其他软件系统的并发症和缺点。捕获数据中的共同发生模式的协会规则挖掘引起了数据仓储研究人员和从业者的相当努力。遗憾的是,大多数数据挖掘工具都是松散的耦合,最多是数据仓库存储库。此外,这些工具通常只能在数据仓库的主要事实表中找到关联规则(因此忽略了星形模式的信息丰富的维度),并且不容易应用于数据仓库中经常发现的非交易级别数据。在本文中,我们提出了一种新的数据挖掘框架,与数据仓储技术紧密集成。我们的框架与使用单独的数据挖掘工具有几个优势。首先,数据留在数据仓库,因此大大减少了安全性和隐私问题的管理。其次,我们利用数据仓库本身的查询处理能力,而无需使用单独的数据挖掘工具。此外,此框架允许在整个数据仓库中挖掘ad-hoc数据挖掘,而不仅仅是在使用标准数据挖掘工具时所需的数据的转换部分。最后,此框架还将关联规则挖掘的域从事务级数据扩展到聚合数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号