首页> 外文会议>Annual ACM symposium on applied computing;ACM symposium on applied computing;SAC 2010 >A persistent HY-Tree to efficiently support itemset mining on large datasets

【24h】

A persistent HY-Tree to efficiently support itemset mining on large datasets

机译：持久的HY-Tree，可有效支持大型数据集上的项集挖掘

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the HY-Tree persistent tree structure that provides a compact representation of a transactional dataset for frequent itemset mining. The HY-Tree is characterized by a hybrid structure that easily adapts to different data distributions. The data representation is complete, since no support threshold is enforced during the HY-TREE creation process. The HY-Tree can be profitably exploited by a variety of itemset mining algorithms (e.g., LCM v.2, nonordFP). It effectively supports the data retrieval step in the itemset mining process by reducing both the I/O cost and the memory requirements for data loading. Experiments on large synthetic datasets show the compactness of the HY-Tree data representation and the efficiency and scalability on large datasets of the mining algorithms supported by it.

机译：本文介绍了HY-Tree持久树结构，该结构为频繁项集挖掘提供了事务性数据集的紧凑表示。 HY-Tree的特征在于混合结构，可以轻松适应不同的数据分布。数据表示是完整的，因为在HY-TREE创建过程中没有实施支持阈值。可以通过各种项目集挖掘算法（例如LCM v.2，nonordFP）来有益地利用HY-Tree。它通过降低I / O成本和数据加载的内存要求，有效地支持了项集挖掘过程中的数据检索步骤。在大型综合数据集上进行的实验表明，HY-Tree数据表示的紧凑性及其支持的挖掘算法在大型数据集上的效率和可扩展性。

著录项

来源
《Annual ACM symposium on applied computing;ACM symposium on applied computing;SAC 2010 》|2010年|P.1060-1064|共5页
会议地点
作者
Elena Baralis; Tania Cerquitelli; Silvia Chiusano;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术 ;
关键词
itemset mining; knowledge discovery;

机译：项目集挖掘;知识发现;

相似文献

外文文献
中文文献
专利

1. An efficient dynamic switching algorithm for mining colossal closed itemsets from high dimensional datasets [J] . Vanahalli Manjunath K., Patil Nagamma Data & Knowledge Engineering . 2019 ,第Sepa期

机译：从高维数据集中挖掘巨大封闭项目集的有效动态切换算法
2. An efficient parallel row enumerated algorithm for mining frequent colossal closed itemsets from high dimensional datasets [J] . Vanahalli Manjunath K., Patil Nagamma Information Sciences: An International Journal . 2019 ,第期

机译：一种有效的并行行枚举算法，用于从高维数据集频繁频繁的巨大闭合项集
3. EIFDD: An efficient approach for erasable itemset mining of very dense datasets [J] . Giang Nguyen, Tuong Le, Bay Vo, Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2015 ,第1期

机译：EIFDD：一种非常密集的数据集可擦除项集挖掘的有效方法
4. A persistent HY-Tree to efficiently support itemset mining on large datasets [C] . Annual ACM symposium on applied computing . 2010

机译：一个持久的Hy-tree，以有效地支持在大型数据集上的项目集挖掘
5. Efficiently mining frequent itemsets from very large databases. [D] . Zhu, Jianfei. 2004

机译：从大型数据库中有效地挖掘频繁的项目集。
6. An efficient pattern growth approach for mining fault tolerant frequent itemsets [O] . Shariq Bashir -1

机译：挖掘容错频繁项集的有效模式增长方法
7. E-msNFIS: An Efficient Method for Mining Negative Frequent Itemsets based on Multiple Minimum Supports [O] . Xiangjun Dong, Tiantian Xu, Yuanyuan Xu, 2015

机译：E-MSNFIS：基于多个最小支持的挖掘负频率集合的有效方法
8. BAMBOO: Accelerating Closed Itemset Mining by Deeply Pushing the Length- Decreasing Support Constraint [R] . Wang, J. , Karypis, G. 2003

机译：BamBOO：通过深度推动长度减小的支撑约束来加速闭项集挖掘

A persistent HY-Tree to efficiently support itemset mining on large datasets

摘要

著录项

相似文献

相关主题

期刊订阅