Approximation to Expected Support of Frequent Itemsets in Mining Probabilistic Sets of Uncertain Data

Alfredo Cuzzocrea; Carson K. Leung; Richard Kyle MacKinnon

首页> 外文期刊>Procedia Computer Science >Approximation to Expected Support of Frequent Itemsets in Mining Probabilistic Sets of Uncertain Data

【24h】

Approximation to Expected Support of Frequent Itemsets in Mining Probabilistic Sets of Uncertain Data

机译：挖掘不确定数据概率集中的频繁项集的预期支持的近似值

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Knowledge discovery and data mining generally discovers implicit, previously unknown, and useful knowledge from data. As one of the popular knowledge discovery and data mining tasks, frequent itemset mining, in particular, discovers knowledge in the form of sets of frequently co-occurring items, events, or objects. On the one hand, in many real-life applications, users mine frequent patterns from traditional databases of precise data, in which users know certainly the presence of items in transactions. On the other hand, in many other real-life applications, users mine frequent itemsets from probabilistic sets of uncertain data, in which users are uncertain about the likelihood of the presence of items in transactions. Each item in these probabilistic sets of uncertain data is often associated with an existential probability expressing the likelihood of its presence in that transaction. To mine frequent itemsets from these probabilistic datasets, many existing algorithms capture lots of information to compute expected support. To reduce the amount of space required, algorithms capture some but not all information in computing or approximating expected support. The tradeoff is that the upper bounds to expected support may not be tight. In this paper, we examine several upper bounds and recommend to the user which ones consume less space while providing good approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data.

机译：知识发现和数据挖掘通常会从数据中发现隐式，先前未知和有用的知识。作为流行的知识发现和数据挖掘任务之一，频繁项集挖掘尤其以频繁出现的项目，事件或对象的集合的形式发现知识。一方面，在许多实际应用中，用户从传统的精确数据数据库中挖掘出频繁的模式，在这些数据库中，用户当然知道交易中项目的存在。另一方面，在许多其他实际应用中，用户从概率不确定的数据集中挖掘频繁的项目集，其中用户不确定交易中项目存在的可能性。这些概率不确定性数据集中的每个项目通常与一个存在概率相关联，该概率表示该事务中该项目存在的可能性。为了从这些概率数据集中挖掘频繁项集，许多现有算法捕获了大量信息以计算预期支持。为了减少所需的空间量，算法会在计算或近似预期支持时捕获部分而非全部信息。折衷方案是预期支撑的上限可能并不紧。在本文中，我们研究了几个上限，并向用户推荐了哪些上限占用更少的空间，同时为挖掘不确定性数据的概率集中的频繁项集提供了良好的预期支持。

著录项

来源
《Procedia Computer Science》 |2015年第1期|共10页
作者
Alfredo Cuzzocrea; Carson K. Leung; Richard Kyle MacKinnon;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Approximation to Expected Support of Frequent Itemsets in Mining Probabilistic Sets of Uncertain Data [J] . Alfredo Cuzzocrea, Carson K. Leung, Richard Kyle MacKinnon Procedia Computer Science . 2015,第1期

机译：挖掘不确定数据概率集中的频繁项集的预期支持的近似值
2. Efficient weighted probabilistic frequent itemset mining in uncertain databases [J] . Li Zhiyang, Chen Fengjuan, Wu Junfeng, Expert Systems . 2021,第5期

机译：在不确定数据库中有效的加权概率频繁漏洞挖掘
3. Probabilistic maximal frequent itemset mining methods over uncertain databases [J] . Li Haifeng, Hai Mo, Zhang Ning, Intelligent data analysis . 2019,第6期

机译：概率最大频繁的项目集挖掘方法在不确定数据库中
4. Analyzing Expected Support-Based Frequent Itemsets over Uncertain Data [C] . Fengjuan Chen, Wenyu Qu, Zhiyang Li IEEE International Conference on High Performance Computing and Communications;IEEE International Conference on Smart City;IEEE International Conference on Data Science and Systems . 2018

机译：分析不确定数据上基于预期支持的频繁项目集
5. Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining. [D] . Hao, Boyu. 2010

机译：从不确定的数据中挖掘频繁项集：约束挖掘和流挖掘的扩展。
6. SiBIC: A Web Server for Generating Gene Set Networks Based on Biclusters Obtained by Maximal Frequent Itemset Mining [O] . Kei-ichiro Takahashi, Ichigaku Takigawa, Hiroshi Mamitsuka -1

机译：SiBIC：一种基于Biclusters的基因组网络生成Web服务器该Biclusters通过最大频繁项集挖掘获得
7. Approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data [O] . Cuzzocrea Alfredo, Leung Carson K., Mackinnon Richard Kyle 2015

机译：挖掘不确定数据的概率集中频繁项集的预期支持的近似值

Approximation to Expected Support of Frequent Itemsets in Mining Probabilistic Sets of Uncertain Data

摘要

著录项

相似文献

相关主题

期刊订阅