首页> 外文会议>Pacific-Asia Conference on Knowledge Discovery and Data Mining >OCEAN: Fast Discovery of High Utility Occupancy Itemsets
【24h】

OCEAN: Fast Discovery of High Utility Occupancy Itemsets

机译:海洋:快速发现高实用占用符集

获取原文

摘要

Frequent pattern mining has been widely studied in the past decades and has been applied to many domains. In particular, numerical transaction databases, where not only the items but also the utility associated with them are available in user transactions, are useful for real applications. For example, customer mobile App traffic data collected by mobile service providers contains such information. In this paper, we aim to find frequent patterns that occupy a large portion of total utility of the supporting transactions, to answer questions like "On which mobile Apps do the customers spend most of their data traffic?" Towards this goal, we define a measure called utility occupancy to measure the contribution of a pattern within a transaction. The challenge of high utility occupancy itemset discovering is the lack of monotone or anti-monotone property. So we derive an upper bound for utility occupancy and design an efficient mining algorithm called OCEAN based on a fast implementation of utility list. Evaluations on real world mobile App traffic data and other three datasets show that OCEAN is efficient and effective in finding frequent patterns with large utility occupancy.
机译:在过去的几十年中,频繁的模式挖掘已被广泛研究,并已应用于许多领域。特别是,数字事务数据库,不仅可以在用户事务中提供与它们关联的项目,而且在用户交易中可用,对真实应用有用。例如,移动服务提供商收集的客户移动应用程序流量数据包含此类信息。在本文中,我们的目标是找到频繁的模式,占据支持交易的大量实用程序的模式,以回答“客户在哪些移动应用程序在其大部分数据流量上?”这样的问题为了实现这一目标,我们定义了一种名为实用程序占用的度量,以衡量交易中的模式的贡献。高效占用术例发现发现的挑战是缺乏单调或抗单调性。因此,我们派生实用程序占用的上限,并根据实用程序列表的快速实现,设计了一个称为Ocean的高效采矿算法。对现实世界移动应用程序的评估和其他三个数据集显示,海洋在找到具有大型公用事业占用的频繁模式方面是高效且有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号