Itemset Mining on Indexed Data Blocks

机译：索引数据块上的项目集挖掘

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a novel index, called I-Forest, to support data mining activities on evolving databases, whose content is periodically updated through insertion (or deletion) of data blocks. I-Forest allows the extraction of itemsets from transactional databases such as transactional data from large retail chains. Item, support and time constraints may be enforced during the extraction phase. The proposed index is a covering index that represents transactional blocks in a succinct form and allows different kinds of analysis (e.g., analyze quarterly data). During the creation phase no support constraint is enforced. Thus, the index provides a complete representation of the evolving data. The I-Forest index has been implemented into the Post-greSQL open source DBMS and exploits its physical level access methods. Experiments have been run for both sparse and dense data distributions. The execution time of the frequent itemset extraction task exploiting the index is always comparable with and for low support threshold faster than the Prefix-Tree algorithm accessing static data on at file.

机译：本文介绍了一个名为I-Forest的新索引，以支持在不断发展的数据库上的数据挖掘活动，其内容通过数据块的插入（或删除）定期更新。 I-Forest允许从交易数据库中提取项目集，例如来自大型零售链的事务数据。在提取阶段期间可以强制执行项目，支持和时间约束。所提出的指数是一种覆盖索引，其以简洁的形式代表事务块，并允许不同种类的分析（例如，分析季度数据）。在创建阶段，不强制执行支持约束。因此，该索引提供了不断变化数据的完整表示。 I-Forest索引已在后GRESQL开源DBMS中实现并利用其物理级别访问方法。已经为稀疏和密集数据分布进行了实验。频繁的项目集提取任务的执行时间始终与在访问文件上访问静态数据的前缀树算法时始终与低支持阈值相当。

著录项

来源
《International IEEE Conference Intelligent Systems》|2006年||共6页
会议地点
作者
Elena Baralis; Tania Cerquitelli; Silvia Chiusano;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Algorithms; Performance; Relational DBMS; Itemset Extraction;

机译：算法;性能;关系DBMS;项目集提取;

相似文献

外文文献
中文文献
专利

1. Constrained Itemset Mining on a Sequence of Incoming Data Blocks [J] . Elena Baralis, Tania Cerquitelli, Silvia Chiusano International journal of entelligent systems . 2010,第5期

机译：传入数据块序列上的约束项集挖掘
2. An efficient projection-based indexing approach for mining high utility itemsets [J] . Guo-Cheng Lan, Tzung-Pei Hong, Vincent S. Tseng Knowledge and information systems . 2014,第1期

机译：一种高效的基于投影的索引方法，用于挖掘高实用性项目集
3. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
4. Itemset Mining on Indexed Data Blocks [C] . Elena Baralis, Tania Cerquitelli, Silvia Chiusano International IEEE Conference Intelligent Systems . 2006

机译：索引数据块上的项目集挖掘
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. Gene Expression Data Analysis Using Closed Itemset Mining for Labeled Data [O] . Ana Rotter, Petra Kralj Novak, Špela Baebler, -1

机译：使用封闭项集挖掘标记数据的基因表达数据分析
7. Blocking anonymity threats raised by frequent itemset mining [O] . Maurizio Atzori, Francesco Bonchi 2005

机译：阻止频繁项集挖掘引发的匿名威胁

Itemset Mining on Indexed Data Blocks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅