A Bounded and Adaptive Memory-Based Approach to Mine Frequent Patterns From Very Large Databases

Adnan M.; Alhajj R.

首页> 外文期刊>Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on >A Bounded and Adaptive Memory-Based Approach to Mine Frequent Patterns From Very Large Databases

【24h】

A Bounded and Adaptive Memory-Based Approach to Mine Frequent Patterns From Very Large Databases

机译：基于有界和自适应内存的非常大型数据库的矿山频繁模式方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most of the existing methods to solve the problem of association rules mining (ARM) rely on special data structures to project the database (either totally or partially) in the primary memory. Traditionally, these data structures reside in the main memory and rely on the existing paging mechanism of the virtual memory manager (VMM) to handle the storage problem when they go out of the primary memory. Typically, VMM stores the overloaded data into the secondary memory based on some preassumed memory usage criteria. However, this direct and unplanned use of virtual memory results in an unpredictable behavior or thrashing, as depicted by some of the works described in the literature. This problem is tackled in this paper by presenting an ARM model capable of mining a transactional database, regardless of its size and without relying on the underlying VMM; the proposed approach could use only a bounded portion of the primary memory and this gives the opportunity to assign other parts of the main memory to other tasks with different priority. In other words, we propose a specialized memory management system which caters to the needs of the ARM model in such a way that the proposed data structure is constructed in the available allocated primary memory first. If at any point the structure grows out of the allocated memory quota, it is forced to be partially saved on secondary memory. The secondary memory version of the structure is accessed in a block-by-block basis so that both the spatial and temporal localities of the I/O access are optimized. Thus, the proposed framework takes control of the virtual memory access and hence manages the required virtual memory in an optimal way to the best benefit of the mining process to be served. Several clever data structures are used to facilitate these optimizations. Our method has the additional advantage that other tasks of different priorities may run concurrently with the main mining task with as little interference as possibl-n-ne because we do not rely on the default paging mechanism of the VMM. The reported test results demonstrate the applicability and effectiveness of the proposed approach.

机译：解决关联规则挖掘（ARM）问题的大多数现有方法都依赖于特殊的数据结构来将数据库（全部或部分）投影到主内存中。传统上，这些数据结构驻留在主内存中，并且在它们离开主内存时依靠虚拟内存管理器（VMM）的现有分页机制来处理存储问题。通常，VMM根据一些假定的内存使用条件将过载的数据存储到辅助内存中。但是，如文献中描述的一些工作所描绘的那样，对虚拟内存的这种直接和无计划的使用导致了不可预测的行为或崩溃。本文通过提出一种ARM模型来解决此问题，该模型能够挖掘事务数据库，而不管其大小如何，并且无需依赖底层VMM。所提出的方法只能使用主存储器的有限部分，这为将主存储器的其他部分分配给具有不同优先级的其他任务提供了机会。换句话说，我们提出了一种专门的存储器管理系统，该系统可以满足ARM模型的需求，从而首先在可用的已分配主存储器中构造提出的数据结构。如果结构在任何时候都超出了分配的内存配额，则将被强制部分保存在辅助内存中。以块为单位访问该结构的辅助内存版本，以便优化I / O访问的空间和时间位置。因此，所提出的框架控制了虚拟存储器的访问，并因此以最佳方式管理所需的虚拟存储器，以最大程度地利用要服务的挖掘过程。一些聪明的数据结构用于促进这些优化。我们的方法的另一个优点是，具有不同优先级的其他任务可以与主要挖掘任务同时运行，而干扰却不如pos-n-ne少，这是因为我们不依赖于VMM的默认分页机制。报告的测试结果证明了该方法的适用性和有效性。

著录项

来源
《Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on》 |2011年第1期|p.154-172|共19页
作者
Adnan M.; Alhajj R.;
展开▼
作者单位

Department of Computer Science, University of Calgary, Calgary, Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Association rules mining (ARM); FP-growth; FP-tree; frequent pattern mining; frequent patterns; index structures; secondary storage; virtual memory management (VMM);

机译：关联规则挖掘（ARM）;FP-增长;FP-树;频繁模式挖掘;频繁模式;索引结构;二级存储;虚拟内存管理（VMM）;

相似文献

外文文献
中文文献
专利

1. A New Approach to Mine Frequent Pattern in Spatial Database using TFP-Tree [J] . Jitendra Agrawal, Kedar Nath Singh, Sanjeev Sharma International Journal of Computer Technology and Applications . 2011,第04期

机译：TFP树在空间数据库中挖掘频繁模式的新方法
2. Hmine-rev: Toward H-mine Parallelization on Mining Frequent Patterns in Large Databases [J] . Bowo PRASETYO, Iko PRAMUDIONO, Masaru KITSUREGAWA, 電子情報通信学会技術研究報告. デ-タ工学. Data Engineering . 2005,第172期

机译：Hmine-rev：在大型数据库中挖掘频繁模式时实现H-mine并行化
3. Hmine-rev: Toward H-mine Parallelization on Mining Frequent Patterns in Large Databases [J] . Bowo PRASETYO, Iko PRAMUDIONO, Masaru KITSUREGAWA 電子情報通信学会技術研究報告. デ-タ工学. Data Engineering . 2005,第172期

机译：Hmine-rev：在大型数据库中挖掘频繁模式时实现H-mine并行化
4. An Efficient Approach to Mine Periodic-Frequent Patterns in Transactional Databases [C] . Akshat Surana, R. Uday Kiran, P. Krishna Reddy New frontiers in applied data mining. . 2011

机译：事务数据库中矿山周期性模式的有效方法
5. Frequent Itemset Hiding Algorithm Using Frequent Pattern Tree Approach. [D] . Alnatsheh, Rami. 2012

机译：使用频繁模式树方法的频繁项集隐藏算法。
6. An Efficient Approach to Mining Maximal Contiguous Frequent Patterns from Large DNA Sequence Databases [O] . Md. Rezaul Karim, Md. Mamunur Rashid, Byeong-Soo Jeong, 2012

机译：从大型DNA序列数据库中挖掘最大连续频率模式的有效方法
7. A New Approach to Mine Frequent Pattern in Spatial Database using TFP-Tree [O] . Kedar Nath Singh, Jitendra Agrawal, Sanjeev Sharma 2011

机译：利用TFp树在空间数据库中挖掘频繁模式的新方法
8. Crime Pattern Analysis: A Spatial Frequent Pattern Mining Approach. [R] . D. Oliver P. Mohan S. Shekhar X. Zhou 2012

机译：犯罪模式分析：一种空间频繁模式挖掘方法。

A Bounded and Adaptive Memory-Based Approach to Mine Frequent Patterns From Very Large Databases

摘要

著录项

相似文献

相关主题

期刊订阅