A Greedy Approach to Concurrent Processing of Frequent Itemset Queries

机译：一种贪婪地处理频繁项目集查询的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem of concurrent execution of multiple frequent itemset queries. If such data mining queries operate on overlapping parts of the database, then their overall I/O cost can be reduced by integrating their dataset scans. The integration requires that data structures of many data mining queries are present in memory at the same time. If the memory size is not sufficient to hold all the data mining queries, then the queries must be scheduled into multiple phases of loading and processing. Since finding the optimal assignment of queries to phases is infeasible for large batches of queries due to the size of the search space, heuristic algorithms have to be applied. In this paper we formulate the problem of assigning the queries to phases as a particular case of hypergraph partitioning. To solve the problem, we propose and experimentally evaluate two greedy optimization algorithms.

机译：我们考虑并发执行多个频繁项目集查询的问题。如果此类数据挖掘查询在数据库的重叠部分上运行，则通过集成数据集扫描，可以减少其整体I / O成本。集成要求许多数据挖掘查询的数据结构同时存在于内存中。如果存储器大小不足以保存所有数据挖掘查询，则必须将查询调度到加载和处理的多个阶段。由于发现对阶段的查询的最佳分配是不可行的，因为由于搜索空间的大小，因此必须应用启发式算法。在本文中，我们制定将查询分配给阶段的问题，作为一个特定的超图分区的情况。为了解决问题，我们提出并通过实验评估了两个贪婪优化算法。

著录项

来源
《International Conference on Data Warehousing and Knowledge Discovery(DaWaK 2006)》|2006年||共10页
会议地点
作者
Pawel Boinski; Marek Wojciechowski; Maciej Zakrzewicz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Integration of candidate hash trees in concurrent processing of frequent itemset queries using Apriori [J] . Przemyslaw Grudzinski, Marek Wojciechowski Control & Cybernetics . 2009,第1期

机译：使用Apriori在频繁项集查询的并发处理中集成候选哈希树
2. A novel process-based association rule approach through maximal frequent itemsets for big data processing [J] . Zelei Liu, Liang Hu, Chunyi Wu, Future generation computer systems . 2018,第APRa期

机译：通过最大频繁项集进行大数据处理的基于过程的新颖关联规则方法
3. Efficient privacy-preserving frequent itemset query over semantically secure encrypted cloud database [J] . Wu Wei, Xian Ming, Parampalli Udaya, World Wide Web . 2021,第2期

机译：高效保留频繁的itemset查询语义安全加密云数据库
4. A Greedy Approach to Concurrent Processing of Frequent Itemset Queries [C] . Pawel Boinski, Marek Wojciechowski, Maciej Zakrzewicz Data Warehousing and Knowledge Discovery; Lecture Notes in Computer Science; 4081 . 2006

机译：并发处理频繁项查询的贪婪方法
5. Frequent Itemset Hiding Algorithm Using Frequent Pattern Tree Approach. [D] . Alnatsheh, Rami. 2012

机译：使用频繁模式树方法的频繁项集隐藏算法。
6. DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach [O] . Akdes Serin, Martin Vingron 2011

机译：DeBi：使用频繁项集方法发现差异表达的Biclusters
7. Three Strategies for Concurrent Processing of Frequent Itemset Queries Using FP-growth* [O] . Marek Wojciechowski, Krzysztof Galecki, Krzysztof Gawronek 2008

机译：使用Fp-growth *同时处理频繁项集查询的三种策略*
8. Frequent Itemset Mining for Query Expansion in Microblog Ad-hoc Search. [R] . Aboulnaga, Y., Clarke, C. L. 2012

机译：微博ad-hoc搜索中用于查询扩展的频繁项集挖掘。

A Greedy Approach to Concurrent Processing of Frequent Itemset Queries

摘要

著录项

相似文献

相关主题

期刊订阅