On the Efficient Representation of Datasets as Graphs to Mine Maximal Frequent Itemsets

Halim Zahid; Ali Omer; Khan Muhammad Ghufran

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >On the Efficient Representation of Datasets as Graphs to Mine Maximal Frequent Itemsets

【24h】

On the Efficient Representation of Datasets as Graphs to Mine Maximal Frequent Itemsets

机译：关于数据集的高效表示为挖掘最大频繁项集的图表

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Frequent itemsets mining is an active research problem in the domain of data mining and knowledge discovery. With the advances in database technology and an exponential increase in data to be stored, there is a need for efficient approaches that can quickly extract useful information from such large datasets. Frequent Itemsets (FIs) mining is a data mining task to find itemsets in a transactional database which occur together above a certain frequency. Finding these FIs usually requires multiple passes over the databases; therefore, making efficient algorithms crucial for mining FIs. This work presents a graph-based approach for representing a complete transactional database. The proposed graph-based representation enables the storing of all relevant information (for extracting FIs) of the database in one pass. Later, an algorithm that extracts the FIs from the graph-based structure is presented. Experimental results are reported comparing the proposed approach with 17 related FIs mining methods using six benchmark datasets. Results show that the proposed approach performs better than others in terms of time.

机译：频繁的项目挖掘是数据挖掘和知识发现领域的积极研究问题。随着数据库技术的进步和要存储的数据的指数增加，需要有效的方法，可以快速从这些大型数据集中提取有用信息。频繁的项目集（FIS）挖掘是一种数据挖掘任务，可以在一定频率上一起出现的事务数据库中找到项目集。找到这些FIS通常需要多次通过数据库;因此，高效的算法对于采矿FIS至关重要。这项工作提出了一种基于图形的方法，用于表示完整的事务数据库。所提出的基于图形的表示，可以在一次通过中存储数据库的所有相关信息（用于提取FIS）。稍后，提出了一种从基于图形的结构中提取FIS的算法。报告了使用六个基准数据集比较了使用六个基准数据集的17个相关的FIS挖掘方法的提出方法。结果表明，在时间方面，该方法的表现比其他方法更好。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2021年第4期|1674-1691|共18页
作者
Halim Zahid; Ali Omer; Khan Muhammad Ghufran;
展开▼
作者单位

Ghulam Ishaq Khan Inst Engn Sci & Technol Machine Intelligence Res Grp MInG Fac Comp Sci & Engn Topi 23460 Pakistan;

Ghulam Ishaq Khan Inst Engn Sci & Technol Machine Intelligence Res Grp MInG Fac Comp Sci & Engn Topi 23460 Pakistan;

Ghulam Ishaq Khan Inst Engn Sci & Technol Machine Intelligence Res Grp MInG Fac Comp Sci & Engn Topi 23460 Pakistan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Itemsets; Data mining; Databases; Data structures; Task analysis; Benchmark testing; Machine intelligence; Efficient frequent itemsets extraction; efficient data structure; graph utility; maximal frequent itemsets;

机译：项目集;数据挖掘;数据结构;任务分析;基准测试;机器智能;高效频繁的项目集提取;高效数据结构;图形实用性;最大频繁的项目集;

相似文献

外文文献
中文文献
专利

1. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
2. AT-Mine: An Efficient Algorithm of Frequent Itemset Mining on Uncertain Dataset [J] . Le Wang, Lin Feng, Mingfei Wu Journal of Computers . 2013,第6期

机译：at-mine：在不确定数据集中的频繁替代项目集的高效算法
3. An efficient parallel row enumerated algorithm for mining frequent colossal closed itemsets from high dimensional datasets [J] . Vanahalli Manjunath K., Patil Nagamma Information Sciences: An International Journal . 2019,第期

机译：一种有效的并行行枚举算法，用于从高维数据集频繁频繁的巨大闭合项集
4. Efficiently Mining Maximal Frequent Itemsets Based on Digraph [C] . Zhibo Ren, Qiang Zhang, Xiujuan Ma International Conference on Fuzzy Systems and Knowledge Discovery . 2007

机译：基于数字化的高效挖掘最大频繁项目集
5. Efficiently mining frequent itemsets from very large databases. [D] . Zhu, Jianfei. 2004

机译：从大型数据库中有效地挖掘频繁的项目集。
6. Utilizing maximal frequent itemsets and social network analysis for HIV data analysis [O] . Yunuscan Koçak, Tansel Özyer, Reda Alhajj 2016

机译：利用最大频繁项集和社交网络分析进行HIV数据分析
7. Exploiting the Duality of Maximal Frequent Itemsets and Minimal Infrequent Itemsets for I/O Efficient Association Rule Mining [O] . K. K. Loo, Chi-lap Yip, Ben Kao, 2000

机译：利用最大频繁项集和最小频繁项集的对偶性进行I / O有效关联规则挖掘

On the Efficient Representation of Datasets as Graphs to Mine Maximal Frequent Itemsets

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅