首页> 外文会议>Intelligent Information and Database Systems, 2009. ACIIDS 2009 >Processing Exact Results for Sliding Window Joins over Time-Sequence, Streaming Data Using a Disk Archive
【24h】

Processing Exact Results for Sliding Window Joins over Time-Sequence, Streaming Data Using a Disk Archive

机译:处理按时间顺序滑动窗口联接的精确结果,使用磁盘存档流数据

获取原文

摘要

We consider the problem of processing exact results for sliding window joins over data streams with limited memory. Existing approaches deal with memory limitations by shedding loads, and therefore cannot provide exact or even highly accurate results for sliding window joins over data streams showing time varying rate of data arrivals. We provide an exact window join (EWJ) algorithm incorporating disk storage as an archive. Our algorithm spills window data onto the disk on a periodic basis, refines the output result by properly retrieving the disk resident data, and maximizes output rate by employing techniques to manage the memory blocks. The problem of managing the window blocks in memory-similar in nature to the caching issue-captures both the temporal and frequency related properties of the stream arrivals. At the same, we improve I/O efficiency by amortizing a disk scan over a large number of input tuple. We provide experimental results demonstrating the performance and effectiveness of the proposed algorithm.
机译:我们考虑在内存有限的数据流上处理滑动窗口联接的精确结果的问题。现有方法通过减少负载来处理内存限制,因此无法为显示数据到达率随时间变化的数据流上的滑动窗口联接提供精确甚至高精度的结果。我们提供了一个精确的窗口连接(EWJ)算法,该算法将磁盘存储作为存档。我们的算法会定期将窗口数据溢出到磁盘上,通过适当地检索磁盘驻留数据来优化输出结果,并通过采用管理内存块的技术来最大化输出速率。本质上类似于缓存的管理内存中的窗口块的问题捕获了流到达的时间和频率相关属性。同时,我们通过分摊大量输入元组的磁盘扫描来提高I / O效率。我们提供的实验结果证明了该算法的性能和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号