...
【24h】

HIGH UTILITY ITEMSETS MINING

机译:高实用性项目挖掘

获取原文
获取原文并翻译 | 示例

摘要

High utility itemsets mining identifies itemsets whose utility satisfies a given threshold. It allows users to quantify the usefulness or preferences of items using different values. Thus, it reflects the impact of different items. High utility itemsets mining is useful in decision-making process of many applications, such as retail marketing and Web service, since items are actually different in many aspects in real applications. However, due to the lack of "downward closure property", the cost of candidate generation of high utility itemsets mining is intolerable in terms of time and memory space. This paper presents a Two-Phase algorithm which can efficiently prune down the number of candidates and precisely obtain the complete set of high utility itemsets. The performance of our algorithm is evaluated by applying it to synthetic databases and two real-world applications. It performs very efficiently in terms of speed and memory cost on large databases composed of short transactions,which are difficult for existing high utility itemsets mining algorithms to handle. Experiments on real-world applications demonstrate the significance of high utility itemsets in business decision-making, as well as the difference between frequent itemsets and high utility itemsets.
机译:高实用性项目集挖掘可识别其实用性满足给定阈值的项目集。它允许用户使用不同的值来量化项目的有用性或偏好。因此,它反映了不同项目的影响。高实用性项目集挖掘在许多应用程序的决策过程中非常有用,例如零售营销和Web服务,因为项目在实际应用程序中实际上在许多方面都不同。但是,由于缺少“向下封闭属性”,因此在时间和存储空间方面,生成高实用性项目集的候选候选对象产生的成本是无法忍受的。本文提出了一种两阶段算法,该算法可以有效地减少候选者的数量,并精确地获得高实用项集的完整集合。通过将其应用于综合数据库和两个实际应用程序,可以评估我们算法的性能。在由短事务组成的大型数据库上,它在速度和内存成本方面非常高效,而现有的高实用性项目集挖掘算法难以处理。实际应用程序上的实验证明了高用途项目集在业务决策中的重要性,以及频繁项目集和高用途项目集之间的区别。

著录项

  • 来源
  • 作者单位

    School of Information Science and EngineeringGraduate University of Chinese Academy of SciencesResearch Center on Fictitious Economy and Data ScienceChinese Academy of Sciences, 80 ZhongGuanCun East RoadBeijing 100190, ChinaBloomberg L.P., USA731 Lexington Avenue, New YorkNY 10022, USAResearch Center on Fictitious Economy and Data ScienceChinese Academy of SciencesHaidian District, Beijing 100190, Chinayshi@gucas.ac.cnandCollege of Information Science and TechnologyUniversity of Nebraska at OmahaOmaha, NE 68182-0392, USADepartment of Electrical Engineering and Computer ScienceNorthwestern University, Evanston, IL 60208, USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Data mining; utility mining; business intelligence.;

    机译:数据挖掘;公用事业采矿;商业智能。;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号