首页> 外文期刊>Engineering Applications of Artificial Intelligence >An indexed set representation based multi-objective evolutionary approach for mining diversified top-k high utility patterns
【24h】

An indexed set representation based multi-objective evolutionary approach for mining diversified top-k high utility patterns

机译:基于索引集表示的多目标进化多目标高效模式挖掘方法

获取原文
获取原文并翻译 | 示例
           

摘要

How to discover top-k patterns with the largest utility values, namely, mining top-k high utility patterns, is a hot topic in data mining. However, most of the existing works for mining top-k high utility patterns consider each pattern separately during the mining process, thus many mined patterns are highly similar and lack diversity. In this paper, we propose to mine top-k high utility patterns with high diversity for enhancing users' satisfaction in recommendation. Specifically, we first introduce a simple measure of coverage to quantify the diversity of the whole set, that is, the top-k patterns as a complete entity. Then we propose an indexed set representation based multi-objective evolutionary approach named ISR-MOEA to mine diversified top-k high utility patterns, due to the fact that the two measures utility and coverage are conflicting. In ISR-MOEA, an indexed set individual representation scheme is suggested for fast encoding and decoding the top-k pattern set. Experimental results on six real-world and two synthetic datasets demonstrate the effectiveness of the proposed approach. The proposed approach can obtain several groups of top-k pattern set with different trade-offs between utility and diversity in only one run, which would further enhance the satisfaction of users.
机译:如何发现具有最大实用价值的top-k模式,即挖掘top-k高实用模式,是数据挖掘中的热门话题。但是,现有的大多数挖掘top-k高效模式的工作在挖掘过程中会分别考虑每个模式,因此许多已开采模式非常相似且缺乏多样性。在本文中,我们建议挖掘具有高度多样性的前k个高实用性模式,以提高用户的推荐满意度。具体来说,我们首先介绍一种简单的覆盖率度量,以量化整个集合的多样性,也就是将top-k模式作为一个完整的实体。然后,由于这两种措施的效用和覆盖范围存在冲突,因此提出了一种基于索引集表示的多目标进化方法,称为ISR-MOEA,以挖掘多样化的top-k高效模式。在ISR-MOEA中,建议使用索引集的单个表示方案来快速编码和解码top-k模式集。在六个真实世界和两个合成数据集上的实验结果证明了该方法的有效性。所提出的方法仅一次运行即可获得几组top-k模式集,并在效用和多样性之间进行折衷,这将进一步提高用户的满意度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号