【24h】

Rare association rule mining for data stream

机译:数据流的稀有关联规则挖掘

获取原文
获取原文并翻译 | 示例

摘要

The immense volumes of data is populated into repositories from various applications. More over data arrives into the repositories continuously i.e. stream of data that cannot be stored into repository due to its varying characteristics. Frequent itemset mining is thoroughly studied by many researchers but important rare items are not discovered by these algorithms. In many cases, the contradictions or exceptions also offers useful associations. In the recent past the researchers started to focus on the discovery of such kind of associations called rare associations. Rare itemsets can be obtained by setting low support but generates huge number of rules. The rare association rule mining is a challenging area of research on data streams. In this paper we proposed an idea to analyze the data stream to identify interesting rare association rules. Rare association rule mining is the process of identifying associations that are having low support but occurs with high confidence. The rare association rules are useful for many applications such as fraudulent credit card usage, anomaly detection in networks, detection of network failures, educational data, medical diagnosis etc. The proposed rare association rule mining algorithm for data stream is implemented using sliding window technique over a stream of data, data is represented in vertical bit sequence format. The advantage of proposed algorithm is that it requires single scan to discover all rare associations. The proposed algorithm outperforms both in terms of memory and time.
机译:大量的数据从各种应用程序填充到存储库中。更多的数据连续到达存储库,即由于其变化的特性而不能存储到存储库中的数据流。许多研究人员对频繁项集挖掘进行了深入研究,但这些算法并未发现重要的稀有项。在许多情况下,矛盾或例外也提供有用的关联。在最近的过去,研究人员开始专注于发现这种称为稀有关联的关联。可以通过设置较低的支持来获得稀有项目集,但是会生成大量规则。罕见的关联规则挖掘是数据流研究中一个充满挑战的领域。在本文中,我们提出了一种分析数据流以识别有趣的稀有关联规则的想法。稀有关联规则挖掘是识别具有低支持率但具有高置信度的关联的过程。稀有关联规则可用于许多应用,例如欺诈性的信用卡使用,网络中的异常检测,网络故障检测,教育数据,医疗诊断等。所提出的稀有关联规则挖掘算法是通过在数据流,数据以垂直位序列格式表示。提出的算法的优点是它需要单次扫描才能发现所有稀有关联。所提出的算法在内存和时间方面均优于。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号