首页> 外文会议>International Conference on Computer Engineering Systems >A Knowledge Management Framework for imbalanced data using Frequent Pattern Mining based on Bloom Filter
【24h】

A Knowledge Management Framework for imbalanced data using Frequent Pattern Mining based on Bloom Filter

机译:基于布隆过滤器的频繁模式挖掘的不平衡数据知识管理框架

获取原文

摘要

Managing medical environments and organizations performance depend directly on the knowledge management (KM) systems. Knowledge Discovery (KD) is responsible for digging information from datasets and finding internal knowledge within organizations or external sources. Data mining (DM) is the core of KD process. Although recent mining techniques have proven their accuracy in discovering the knowledge from balanced data, where the class distribution is balanced, the problem of discovering knowledge from unbalanced data is still a challenge that needs to be addressed. A Clustered Knowledge Management Framework (CKMD) is presented in this paper, for enhancing the performance of KD from unbalanced data. A Simple Hybrid Sampling Approach (SHSA) is proposed to reduce the adverse impacts of imbalanced data. Mining frequent pattern process plays an important role in KD process. Moreover, a Frequent Pattern Mining algorithm based on Bloom Filter (FPMBF) is proposed to discover items that frequently co-occur in the data using the bloom filter, that requires a single scan of the data, which leads to less time consuming in discovering knowledge for imbalanced data. Finally, the performance of the proposed methods is evaluated using real datasets and comparative experiments.
机译:管理医疗环境和组织绩效直接取决于知识管理(KM)系统。知识发现(KD)负责从数据集中挖掘信息,并在组织或外部来源中查找内部知识。数据挖掘(DM)是KD流程的核心。尽管最近的挖掘技术已经证明了其在平衡类分布平衡的平衡数据中发现知识的准确性,但是从不平衡数据中发现知识的问题仍然是需要解决的挑战。本文提出了一个集群知识管理框架(CKMD),用于增强来自不平衡数据的KD的性能。提出了一种简单的混合采样方法(SHSA),以减少不平衡数据的不利影响。频繁模式挖掘在KD过程中起着重要作用。此外,提出了一种基于布隆过滤器的频繁模式挖掘算法(FPMBF),以使用布隆过滤器发现频繁出现在数据中的项,这需要对数据进行一次扫描,从而减少了发现知识所需的时间用于不平衡的数据。最后,使用真实数据集和比较实验评估了所提出方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号