...
首页> 外文期刊>International Journal of Fuzzy Systems >Efficiently Updating the Discovered Multiple Fuzzy Frequent Itemsets with Transaction Insertion
【24h】

Efficiently Updating the Discovered Multiple Fuzzy Frequent Itemsets with Transaction Insertion

机译:通过事务插入有效地更新发现的多个模糊频繁项集

获取原文
获取原文并翻译 | 示例
           

摘要

Most pattern mining approaches such as association rule mining or frequent itemsets mining can only handle, however, the binary database in which each item or attribute is represented as 0 or 1. In real-life situations, data may be represented as more complicatedly. Moreover, it is more meaningful for human begins to understand if the discovered rules can be represented as the linguistic terms. Fuzzy-set theory was adopted to handle the quantitative database, which transforms the quantity value into the linguistic terms with their fuzzy values based on the predefined membership functions. Several algorithms were designed to handle this issue, but most of them focus on mining the fuzzy rules in the static database. When the database size is changed, the whole updated database should be rescanned again to obtain the up-to-date information. This progress ignores the already discovered knowledge with its computational cost, which is inefficient in dynamic applications. In this paper, we present an incremental multiple fuzzy frequent itemset mining with transaction INSertion (IMF-INS) algorithm to efficiently update the multiple fuzzy frequent itemsets from the quantitative dataset. The designed IMF-INS utilizes the fuzzy FUP concept to divide the transformed linguistic terms into four cases. Each case is performed by the designed approach to update the discovered information. Also, the fuzzy list structure is utilized to reduce the generation of candidates without multiple database scans. Experiments are conducted to show the performance of the proposed algorithm in terms of runtime, memory consumption, number of determined patterns and scalability. Results indicated that the designed IMF-INS outperforms the state-of-the-art batch mining progresses.
机译:大多数模式挖掘方法(例如关联规则挖掘或频繁项集挖掘)只能处理,但是,其中每个项或属性都表示为0或1的二进制数据库。在现实情况下,数据可能表示得更复杂。而且,对于人类开始理解所发现的规则是否可以表示为语言术语而言,更有意义。采用模糊集理论处理定量数据库,该数据库基于预定义的隶属度函数将数量值及其模糊值转换成语言术语。设计了几种算法来处理此问题,但大多数算法集中在挖掘静态数据库中的模糊规则。更改数据库大小后,应再次重新扫描整个更新的数据库以获得最新信息。这一进展忽略了已发现的知识及其计算成本,这在动态应用程序中效率低下。在本文中,我们提出了一种使用事务插入(IMF-INS)算法进行增量式模糊频繁项集挖掘,以从定量数据集中有效更新多个模糊频繁项集。设计的IMF-INS使用模糊FUP概念将转换后的语言术语分为四种情况。通过设计方法执行每种情况以更新发现的信息。同样,模糊列表结构用于减少候选对象的生成,而无需进行多次数据库扫描。进行实验以在运行时间,内存消耗,确定的模式数量和可伸缩性方面显示所提出算法的性能。结果表明,设计的IMF-INS优于最新的批量开采进度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号