一种基于Chernoff Bound的数据流上近似频繁项集的挖掘方法

李海峰; 章宁

首页> 中文期刊>计算机科学 >一种基于Chernoff Bound的数据流上近似频繁项集的挖掘方法

一种基于Chernoff Bound的数据流上近似频繁项集的挖掘方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

数据流高速、无限和动态的特点决定了必须在有限的内存中以尽快的计算速度完成流数据上的频繁项集挖掘.将数据流中的数据按照段进行划分,采用二元组列表的数据结构进行保存,提出了一种基于滑动窗口的近似频繁项集挖掘方法AFIoDS,以实时获取频繁项集集合的真子集,并引入了概率参数,利用Chernoff Bound来动态改变支持度的近似值,保证真子集中的频繁项集被限制在一定的误差范围之内.此外,为了进一步节省内存,AFIoDS采用闭合项集的形式压缩每个段中获取的频繁项集.通过在3种真实数据集上的实验表明,AFIoDS算法与现有算法相比,在精度没有下降的情况下,具有更快的处理速度,同时其存储开销大大降低.%A data stream is fast, unlimited and dynamic, these characteristics constraint the computational resources and storages when mining frequent itemsets. This paper addressed this problem and proposed a simple and effective algorithrn AFIoDS, AFIoDS is an approximate algorithm based on sliding window model,which splits stream data into batches and maintains them with 2-tuple lists;thus,a false negative result can be obtained using a probabilistic parameter based on chernoff bound. The approximation will be changed dynamically to guarantee the mining frequent itemsets are error controllable. Plus, a compression of frequent itemsets, the closed frequent itemsets, are employed to represent the results of each batch for further memory saving. Our experimental results on 3 real world data show that without precision reduction, AFIoDS achieves a faster speed and a much reduced memory cost in comparison with the state-of-the-art algorithrna.

著录项

来源
《计算机科学》|2011年第5期|164-168|共5页
作者
李海峰; 章宁;
展开▼
作者单位

中央财经大学信息学院,北京,100081;

中央财经大学信息学院,北京,100081;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序语言、算法语言;
关键词
Chernoff Bound; 数据流; 频繁项集;
入库时间 2022-08-18 04:38:08

相似文献

中文文献
外文文献
专利

1. 一种基于倒排索引的频繁项集挖掘方法 [J] . 贾丽波 ,姜晓明 ,叶青 . 长春理工大学学报（自然科学版） . 2019,第002期
2. 一种基于预裁剪的不确定数据流频繁项集挖掘算法 [J] . 马力 . 科学技术与工程 . 2016,第023期
3. 一种基于滑动窗口的数据流频繁项集挖掘算法 [J] . 寇香霞 ,任永功 ,宋奎勇 . 计算机应用与软件 . 2013,第001期
4. 数据流中一种基于滑动窗口的前K个频繁项集挖掘算法 [J] . 张文煜 ,周满元 . 计算机应用研究 . 2011,第007期
5. 一种基于滑动窗口模型的数据流加权频繁模式挖掘方法 [J] . 石秀金 ,蔡艺松 . 智能计算机与应用 . 2018,第002期
6. 一种基于乐观裁剪策略的挖掘数据流滑动窗口上闭合频繁项集的算法 [C] . . 第二十五届中国数据库学术会议(NDBC2008) . 2008
7. 一种基于垂直划分的数据流频繁项集挖掘算法 [A] . 朱剑波 . 2011

一种基于Chernoff Bound的数据流上近似频繁项集的挖掘方法

摘要

著录项

相似文献

相关主题

期刊订阅