首页> 外文会议>IEEE International Conference on Data Science and Advanced Analytics >Mining actionable combined patterns of high utility and frequency
【24h】

Mining actionable combined patterns of high utility and frequency

机译:挖掘高实用性和高频率的可行组合模式

获取原文

摘要

In recent years, the importance of identifying actionable patterns has become increasingly recognized so that decision-support actions can be inspired by the resultant patterns. A typical shift is on identifying high utility rather than highly frequent patterns. Accordingly, High Utility Itemset (HUI) Mining methods have become quite popular as well as faster and more reliable than before. However, the current research focus has been on improving the efficiency while the coupling relationships between items are ignored. It is important to study item and itemset couplings inbuilt in the data. For example, the utility of one itemset might be lower than user-specified threshold until one additional itemset takes part in; and vice versa, an item's utility might be high until another one joins in. In this way, even though some absolutely high utility itemsets can be discovered, sometimes it is easily to find out that quite a lot of redundant itemsets sharing the same item are mined (e.g., if the utility of a diamond is high enough, all its supersets are proved to be HUIs). Such itemsets are not actionable, and sellers cannot make higher profit if marketing strategies are created on top of such findings. To this end, here we introduce a new framework for mining actionable high utility association rules, called Combined Utility-Association Rules (CUAR), which aims to find high utility and strong association of itemset combinations incorporating item/itemset relations. The algorithm is proved to be efficient per experimental outcomes on both real and synthetic datasets.
机译:近年来,识别可行模式的重要性已得到越来越多的认识,因此,决策支持措施可以从最终的模式中得到启发。一个典型的转变是确定高实用性而不是频繁使用的模式。因此,高实用项集(HUI)挖掘方法已经变得非常流行,并且比以前更快,更可靠。但是,当前的研究重点一直放在提高效率上,而忽略了项目之间的耦合关系。研究数据中内置的项目和项目集耦合很重要。例如,一个项目集的效用可能低于用户指定的阈值,直到另外一个项目集参与为止。反之亦然,一个项目的效用可能很高,直到另一个项目加入为止。这样,即使可以发现一些绝对高效用的项目集,有时也很容易发现共享同一项目的大量冗余项目集是开采(例如,如果钻石的效用足够高,则证明其所有超集都是HUI)。这样的项目集是不可行的,并且如果在这样的发现之上创建营销策略,卖方将无法获得更高的利润。为此,我们在此引入了一种新的框架,用于挖掘可操作的高效用关联规则,称为组合效用关联规则(CUAR),该框架旨在查找合并了项/项集关系的项集组合具有高效用和强关联性。实践证明,该算法在真实数据集和合成数据集上均能有效地满足实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号