首页> 外文学位 >Sampling: An efficient solution for data mining of association rules.
【24h】

Sampling: An efficient solution for data mining of association rules.

机译:采样:一种有效的关联规则数据挖掘解决方案。

获取原文
获取原文并翻译 | 示例

摘要

A classic problem in data mining is to find association rules between items in a large dataset of transactions, where a transaction is a subset of related items. For example, the items of a transaction might have been purchased at the same time. An association rule predicts the likelihood of an item appearing in a transaction at the same time with other items. The first step in finding association rules is discovering frequent itemsets.; In this thesis, we explore sampling techniques that can be used to find frequent itemsets. In particular, we compare the following three sampling algorithms: Simple Random Sampling with Replacement (SRSWR), Finding Associations from Sampled Transactions (FAST), and Finding Associations from Sampled Transaction Randomly (FASTRan). The first two algorithms, SRSWR and FAST, are previously known, whereas FASTRan is a new algorithm that we obtained by modifying FAST. Our experiments show that FAST and FASTRan produce significantly more accurate results, and moreover, our modified algorithm FASTRan has a slightly better performance than FAST.
机译:数据挖掘中的经典问题是在大型交易数据集中找到项目之间的关联规则,其中交易是相关项目的子集。例如,交易的项目可能已同时购买。关联规则可预测某项目与其他项目同时出现在交易中的可能性。查找关联规则的第一步是发现频繁项集。在本文中,我们探索了可用于查找频繁项集的采样技术。特别是,我们比较了以下三种采样算法:带替换的简单随机采样(SRSWR),从采样交易中找到关联(FAST)和从采样交易中随机找到关联(FASTRan)。前两个算法SRSWR和FAST是先前已知的,而FASTRan是我们通过修改FAST获得的新算法。我们的实验表明,FAST和FASTRan可以产生更为准确的结果,此外,我们的改进算法FASTRan的性能比FAST略好。

著录项

  • 作者

    Zhu, Jingbo.;

  • 作者单位

    Dalhousie University (Canada).;

  • 授予单位 Dalhousie University (Canada).;
  • 学科 Computer Science.
  • 学位 M.C.Sc.
  • 年度 2003
  • 页码 p.1762
  • 总页数 85
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号