首页> 外文会议>International conference of Electronics, Communication and Aerospace Technology >Mining high utility itemsets using TKO and TKU to find top-k high utility web access patterns
【24h】

Mining high utility itemsets using TKO and TKU to find top-k high utility web access patterns

机译:使用TKO和TKU挖掘高实用性项目集以找到top-k高实用性Web访问模式

获取原文

摘要

Mining high utility itemsets from a transactional database refers to the discovery of itemsets with high utility like profits. Although a number of relevant approaches have been proposed in recent years, but they incur the problem of producing a large number of candidate itemsets for high utility itemsets. Such a large number of candidate itemsets degrades the mining performance in terms of execution time and space requirement. The situation may become worse when the database contains lots of long transactions or long high utility itemsets. An emerging topic in the field of data mining is utility mining which not only considers the frequency of the itemsets but also considers the utility associated with the itemsets. The main objective of High Utility Itemset Mining is to identify itemsets that have utility values above a given utility threshold. Thus Utility mining plays an important role in many real-time applications and is an important research topic in data mining system to find the itemsets with high profit. In this paper we present the implementation of first module where pre-processing of dataset is done to remove unpromising data from web usage and product base dataset by using TopKRules and also we are proposing a new framework for Top-k high utility web access patterns, where k is the desired number of HUIs to be mined. Two types of efficient algorithms named TKU and TKO are proposed for mining such itemsets. In this paper we present a literature review of the present state of research and the various algorithms for high utility itemset mining.
机译:从交易数据库中挖掘高实用性项目集是指发现具有高实用性(如利润)的项目集。尽管近年来已经提出了许多相关方法,但是它们引起了为高实用性项目集产生大量候选项目集的问题。如此大量的候选项集会降低执行时间和空间要求的挖掘性能。当数据库包含大量的长事务或长时间的高实用项集时,情况可能会变得更糟。数据挖掘领域的一个新兴主题是实用程序挖掘,它不仅考虑项目集的频率,而且考虑与项目集关联的效用。高实用性项目集挖掘的主要目标是识别实用价值高于给定实用性阈值的项目集。因此,实用程序挖掘在许多实时应用中扮演着重要角色,并且是数据挖掘系统中寻找高利润项目集的重要研究课题。在本文中,我们介绍了第一个模块的实现,在该模块中,通过使用TopKRules对数据集进行了预处理,以从Web使用量和产品基础数据集中删除无用的数据,并且我们还为Top-k高实用性Web访问模式提出了一个新框架,其中k是要开采的HUI的期望数量。为挖掘此类项目集,提出了两种有效的算法,分别称为TKU和TKO。在本文中,我们对当前的研究现状以及用于高效项集挖掘的各种算法进行了文献综述。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号