An efficient approach based on selective partitioning for maximal frequent itemsets mining

Bai Anita; Dhabu Meera; Jagtap Viraj; Deshpande Parag S.

首页> 外文期刊>Sadhana: Academy Proceedings in Engineering Science >An efficient approach based on selective partitioning for maximal frequent itemsets mining

【24h】

An efficient approach based on selective partitioning for maximal frequent itemsets mining

机译：基于最大频繁项目集采矿的选择性分区的一种有效方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a maximal frequent itemset (MFI) mining algorithm based on selective partitioning called SelPMiner. It makes use of a novel data format named Itemset-count tree-a compact and optimized representation in the form of partition that reduces memory requirement. It also does selective partitioning of the database, which reduces runtime to scan database. As the algorithm progressively searches for longer frequent itemsets in a depth-first manner, it creates new partitions with even smaller sizes having less dimensions and unique data instances, which results in faster support counting. SelPMiner uses a number of optimizations to prune the search space. We also prove upper bounds on the amount of memory consumed by these partitions. Experimental comparisons of the SelPMiner algorithm with popular existing fastest MFI mining algorithms on different types of datasets show significant speedup in computation time for many cases. SelPMiner works especially well when the minimum support is low and consumes less memory.

机译：我们提出了一种基于名为SELPMINER的选择性分区的最大频繁的项目集（MFI）挖掘算法。它利用名为itemset-count树的新型数据格式 - 以缩小内存要求的分区形式的Compact且优化表示。它还可以选择性分区数据库，这减少了运行时到扫描数据库。随着算法以深度第一方式逐渐搜索更长频繁的项目集，它创建具有较小尺寸和唯一数据实例的更小的尺寸的新分区，从而导致更快的支持计数。 Selpminer使用许多优化来修剪搜索空间。我们还在这些分区消耗的内存量上证明了上限。在不同类型数据集上具有流行现有最快的MFI挖掘算法的SELPMINER算法的实验比较显示了许多情况下的计算时间中的显着加速。当最小支持低并且消耗更少的内存时，SELPMINER特别好。

著录项

来源
《Sadhana: Academy Proceedings in Engineering Science》 |2019年第8期|共22页
作者
Bai Anita; Dhabu Meera; Jagtap Viraj; Deshpande Parag S.;
展开▼
作者单位

Visvesvaraya Natl Inst Technol Dept Comp Sci &

Engn Nagpur 440010 Maharashtra India;

Visvesvaraya Natl Inst Technol Dept Comp Sci &

Engn Nagpur 440010 Maharashtra India;

Amazon Hyderabad India;

Visvesvaraya Natl Inst Technol Dept Comp Sci &

Engn Nagpur 440010 Maharashtra India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Data mining; itemset-count tree; maximal frequent itemsets; partitions; transactional databases;

机译：数据挖掘;项目集合树;最大频繁项目集;分区;事务数据库;

相似文献

外文文献
中文文献
专利

1. An efficient approach based on selective partitioning for maximal frequent itemsets mining [J] . ANITA BAI, MEERA DHABU, VIRAJ JAGTAP, Sadhana . 2019,第8期

机译：基于选择性分区的最大频繁项集挖掘有效方法
2. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
3. Cluster Based Partition Approach for Mining Frequent Itemsets [J] . Akhilesh Tiwari, Rajendra K. Gupta, Dev Prakash Agrawal International journal of computer science and network security . 2009,第6期

机译：基于聚类的频繁项集划分方法
4. Exploiting the Duality of Maximal Frequent Itemsets and Minimal Infrequent Itemsets for I/O Efficient Association Rule Mining [C] . K. K. Loo, YIP Chi Lap, Ben KAO, Database and Expert Systems Applications . 2000

机译：利用最大频繁项集和最小频繁项集的对偶性进行I / O有效关联规则挖掘
5. Efficiently mining frequent itemsets from very large databases. [D] . Zhu, Jianfei. 2004

机译：从大型数据库中有效地挖掘频繁的项目集。
6. SiBIC: A Web Server for Generating Gene Set Networks Based on Biclusters Obtained by Maximal Frequent Itemset Mining [O] . Kei-ichiro Takahashi, Ichigaku Takigawa, Hiroshi Mamitsuka -1

机译：SiBIC：一种基于Biclusters的基因组网络生成Web服务器该Biclusters通过最大频繁项集挖掘获得
7. Exploiting the Duality of Maximal Frequent Itemsets and Minimal Infrequent Itemsets for I/O Efficient Association Rule Mining [O] . K. K. Loo, Chi-lap Yip, Ben Kao, 2000

机译：利用最大频繁项集和最小频繁项集的对偶性进行I / O有效关联规则挖掘

An efficient approach based on selective partitioning for maximal frequent itemsets mining

摘要

著录项

相似文献

相关主题

期刊订阅