首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >OCEAN: Fast Discovery of High Utility Occupancy Itemsets
【24h】

OCEAN: Fast Discovery of High Utility Occupancy Itemsets

机译:海洋:快速发现高级公用事业项目集

获取原文

摘要

Frequent pattern mining has been widely studied in the past decades and has been applied to many domains. In particular, numerical transaction databases, where not only the items but also the utility associated with them are available in user transactions, are useful for real applications. For example, customer mobile App traffic data collected by mobile service providers contains such information. In this paper, we aim to find frequent patterns that occupy a large portion of total utility of the supporting transactions, to answer questions like "On which mobile Apps do the customers spend most of their data traffic?" Towards this goal, we define a measure called utility occupancy to measure the contribution of a pattern within a transaction. The challenge of high utility occupancy itemset discovering is the lack of monotone or anti-monotone property. So we derive an upper bound for utility occupancy and design an efficient mining algorithm called OCEAN based on a fast implementation of utility list. Evaluations on real world mobile App traffic data and other three datasets show that OCEAN is efficient and effective in finding frequent patterns with large utility occupancy.
机译:过去数十年来,频繁模式挖掘已得到广泛研究,并已应用于许多领域。特别是,数字交易数据库对于实际应用程序非常有用,在该数据库中,不仅项目而且与项目相关联的实用程序都可以在用户交易中使用。例如,移动服务提供商收集的客户移动App流量数据包含此类信息。在本文中,我们旨在找到频繁的模式,这些模式占据了支持交易的总效用的很大一部分,以回答诸如“客户将大部分数据流量花费在哪个移动应用程序上?”之类的问题。为了实现这一目标,我们定义了一种称为效用占用的度量,以度量交易中某种模式的贡献。高实用性占用项集发现的挑战是缺少单调或反单调属性。因此,我们得出了实用程序占用率的上限,并基于实用程序列表的快速实现设计了一种称为OCEAN的有效挖掘算法。对现实世界中移动App流量数据和其他三个数据集的评估表明,OCEAN在查找大型公用事业占用频繁的模式方面非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号