首页> 外文会议>International conference on data warehousing and knowledge discovery >SSCJ: A Semi-Stream Cache Join Using a Front-Stage Cache Module
【24h】

SSCJ: A Semi-Stream Cache Join Using a Front-Stage Cache Module

机译:SSCJ:使用前级缓存模块的半流缓存联接

获取原文

摘要

Semi-stream processing has become an emerging area of research in the field of data stream management. One common operation in semi-stream processing is joining a stream with disk-based master data using a join operator. This join operator typically works under limited main memory and this memory is generally not large enough to hold the whole disk-based master data. Recently, a number of semi-stream join algorithms have been proposed in the literature to achieve an optimal performance but still there is room to improve the performance. In this paper we propose a novel Semi-Stream Cache Join (SSCJ) using a front-stage cache module. The algorithm takes advantage of skewed distributions, and we present results for Zipfian distributions of the type that appear in many applications. We analyze the performance of SSCJ with a well known related join algorithm, HYBRIDJOIN (Hybrid Join). We also provide the cost model for our approach and validate it with experiments.
机译:半流处理已成为数据流管理领域中一个新兴的研究领域。半流处理中的一种常见操作是使用联接运算符将流与基于磁盘的主数据联接。此联接运算符通常在有限的主内存下工作,并且此内存通常不足以容纳整个基于磁盘的主数据。近来,在文献中已经提出了许多半流联接算法以实现最佳性能,但是仍然存在改进性能的空间。在本文中,我们提出了一种使用前级缓存模块的新型半流缓存联接(SSCJ)。该算法利用了偏态分布的优势,我们给出了在许多应用中出现的那种类型的Zipfian分布的结果。我们使用一种众所周知的相关联接算法HYBRIDJOIN(混合联接)来分析SSCJ的性能。我们还为我们的方法提供了成本模型,并通过实验对其进行了验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号