首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery >Incremental Data Mining Using Concurrent Online Refresh of Materialized Data Mining Views
【24h】

Incremental Data Mining Using Concurrent Online Refresh of Materialized Data Mining Views

机译:使用并发在线刷新物化数据挖掘视图的增量数据挖掘

获取原文

摘要

Data mining is an iterative process. Users issue series of similar data mining queries, in each consecutive run slightly modifying either the definition of the mined dataset, or the parameters of the mining algorithm. This model of processing is most suitable for incremental mining algorithms that reuse the results of previous queries when answering a given query. Incremental mining algorithms require the results of previous queries to be available. One way to preserve those results is to use materialized data mining views. Materialized data mining views store the mined patterns and refresh them as the underlying data change. Data mining and knowledge discovery often take place in a data warehouse environment. There can be many relatively small materialized data mining views defined over the data warehouse. Separate refresh of each materialized view can be expensive, if the refresh process has to re-discover patterns in the original database. In this paper we present a novel approach to materialized data mining view refresh process. We show that the concurrent on-line refresh of a set of materialized data mining views is more efficient than the sequential refresh of individual views. We present the framework for the integration of data warehouse refresh process with the maintenance of materialized data mining views. Finally, we prove the feasibility of our approach by conducting several experiments on synthetic data sets.
机译:数据挖掘是一个迭代过程。用户发出系列类似的数据挖掘查询,在每次连续运行中略微修改挖掘数据集的定义,或挖掘算法的参数。这种处理型号最适合增量挖掘算法,该算法在接听给定查询时重复使用先前查询的结果。增量挖掘算法需要先前查询的结果可用。保护这些结果的一种方法是使用物化数据挖掘视图。物化数据挖掘视图存储挖掘模式并将其刷新为底层数据更改。数据挖掘和知识发现通常在数据仓库环境中进行。在数据仓库中定义了许多相对较小的物化数据挖掘视图。如果刷新过程必须在原始数据库中重新发现模式,则每个物流视图的单独刷新可能是昂贵的。在本文中,我们提出了一种新的物化数据挖掘视图刷新过程方法。我们表明,一组实质化数据挖掘视图的并发在线刷新比单个视图的顺序刷新更有效。我们介绍了数据仓库刷新过程集成的框架,维护了物化数据挖掘视图。最后,我们通过对合成数据集进行多次实验来证明我们的方法的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号