Cluster Based Partition Approach for Mining Frequent Itemsets

Akhilesh Tiwari; Rajendra K. Gupta; Dev Prakash Agrawal

首页> 外文期刊>International journal of computer science and network security >Cluster Based Partition Approach for Mining Frequent Itemsets

【24h】

Cluster Based Partition Approach for Mining Frequent Itemsets

机译：基于聚类的频繁项集划分方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Data Mining is the process of extracting interesting and previously unknown patterns and correlations form huge data stored in databases. Association rule mining- a descriptive mining technique of data mining, is the process of discovering items or literals which tend to occur together in transactions. As the data to be mined is large, the time taken for accessing data is considerable. This paper describes a new approach for association mining, based on Master-Slave architecture. It uses hybrid approach - a combination of bottom up and top down approaches for searching frequent itemsets. The Apriori algorithm performs well only when the frequent itemsets are short. Algorithms with top down approach are suitable for long frequent itemsets. This new master slave architecture based algorithm combines both bottom-up and top-down approach. The Prime number based representation consumes less memory as each transaction is replaced with the product of the assigned prime numbers of their items. It reduces the time taken to determine the support count of the itemsets. The Prime number based representation offers the flexibility for testing the validity of metarules and provides reduction in the data complexity.

机译：数据挖掘是从存储在数据库中的巨大数据中提取出有趣的，以前未知的模式和相关性的过程。关联规则挖掘-一种数据挖掘的描述性挖掘技术，是发现倾向于在交易中同时出现的项目或文字的过程。由于要挖掘的数据很大，因此访问数据所花费的时间相当可观。本文介绍了一种基于Master-Slave体系结构的关联挖掘新方法。它使用混合方法-自下而上和自上而下方法的组合来搜索频繁的项目集。仅当频繁项集较短时，Apriori算法才能发挥出色的性能。自顶向下方法的算法适用于长期频繁的项目集。这种基于主从架构的新算法结合了自下而上和自上而下的方法。基于质数的表示形式消耗的内存更少，因为每个交易都被为其项目分配的质数的乘积所代替。它减少了确定项目集支持计数所需的时间。基于素数的表示形式为测试元规则的有效性提供了灵活性，并降低了数据复杂性。

著录项

来源
《International journal of computer science and network security》 |2009年第6期|191-199|共9页
作者
Akhilesh Tiwari; Rajendra K. Gupta; Dev Prakash Agrawal;
展开▼
作者单位

Madhav Institute of Technology & Science, Gwalior, India;

Madhav Institute of Technology & Science, Gwalior, India;

Union Public Service Commission, New Delhi, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
frequent patterns; candidate distribution; hybrid approach; KDD;

机译：频繁的模式;候选人分布;混合方法凯迪;

相似文献

外文文献
中文文献
专利

1. Cluster Based Partition Approach for Mining Frequent Itemsets [J] . Akhilesh Tiwari, Rajendra K. Gupta, Dev Prakash Agrawal International journal of computer science and network security . 2009,第6期

机译：基于聚类的频繁项集划分方法
2. An efficient approach based on selective partitioning for maximal frequent itemsets mining [J] . ANITA BAI, MEERA DHABU, VIRAJ JAGTAP, Sadhana . 2019,第8期

机译：基于选择性分区的最大频繁项集挖掘有效方法
3. An efficient approach based on selective partitioning for maximal frequent itemsets mining [J] . Bai Anita, Dhabu Meera, Jagtap Viraj, Sadhana: Academy Proceedings in Engineering Science . 2019,第8期

机译：基于最大频繁项目集采矿的选择性分区的一种有效方法
4. Improvements in the Data Partitioning Approach for Frequent Itemsets Mining [C] . Son N. Nguyen, Maria E. Orlowska . 2005

机译：频繁项目集挖掘的数据分区方法的改进
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data [O] . András Király, Attila Gyenesei, János Abonyi -1

机译：高位二进制数据中基于位表的聚类和频繁封闭项集挖掘
7. Improvements in the Data Partitioning Approach for Frequent Itemsets Mining [O] . Son N. Nguyen, Maria E. Orlowska 2005

机译：频繁项目集挖掘数据分区方法的改进

Cluster Based Partition Approach for Mining Frequent Itemsets

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅