首页> 外文期刊>International journal of computational intelligence research >CASW: Context Aware Sliding window for Frequent Itemset Mining over Data Streams
【24h】

CASW: Context Aware Sliding window for Frequent Itemset Mining over Data Streams

机译:CASW:用于数据流中频繁项集挖掘的上下文感知滑动窗口

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years, advances in both hardware and software technologies coupled with high-speed data generation has led to data streams and data stream mining. Data generation has been much faster in data stream applications and scores of data is generated in quick turnaround time. Hence it becomes obvious to perform mining, data on arrival that is usually termed as data stream mining. General frequent pattern mining methods are envisaging limitations and do not support in responding to a massive quantum of data being streamed. In order to address such limitations, data mining researchers have focused on methods for conducting more efficient and effective mining tasks by scanning a database only once. As a process of evolution, sliding window model that perform mining operations focusing on updating accumulated parts over data streams, are proposed. It is hard to consider all of the frequent patterns in data stream environment as generated patterns were remarkably increasing as data streams get extended continuously. Hence, methods for efficiently compressing patterns that are generated are essential to address the limitations. Considering the challenges and shortcoming in the earlier solutions, in this paper, focus is on incremental mining of frequent patterns from the window and a solution of CASW (Context Aware Sliding Window) is proposed. There are well defined boundaries for frequent and infrequent patterns for specific patterns. In this research article, we adapt usage of window size change for representing conceptual drift in the information stream. An experimental study carried out on the model depicts significant developments and has affirmed that the algorithm has been designed with a more efficient system than that of existing solution.
机译:近年来,硬件和软件技术的进步以及高速数据生成已导致数据流和数据流挖掘。在数据流应用程序中,数据生成已快得多,并且在快速周转时间内即可生成大量数据。因此,进行挖掘很明显,通常将到达时的数据称为数据流挖掘。一般的频繁模式挖掘方法正在预见到局限性,并且不支持对正在传输的大量数据进行响应。为了解决这些限制,数据挖掘研究人员专注于通过仅扫描数据库一次来执行更有效的挖掘任务的方法。作为发展的过程,提出了执行挖掘操作的滑动窗口模型,该模型专注于更新数据流上的累积零件。很难考虑数据流环境中的所有频繁模式,因为随着数据流的不断扩展,生成的模式显着增加。因此,有效地压缩所产生的模式的方法对于解决限制是必不可少的。考虑到早期解决方案中的挑战和缺点,本文着重于从窗口中频繁挖掘频繁模式,并提出了一种CASW(上下文感知滑动窗口)解决方案。对于特定模式的频繁和不频繁模式,都有明确定义的边界。在这篇研究文章中,我们调整了窗口大小变化的用法,以表示信息流中的概念性漂移。在模型上进行的实验研究显示了重要的进展,并确认该算法的设计系统比现有解决方案更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号