首页> 外文会议>IEEE International Conference on Data Engineering >Quality-driven disorder handling for m-way sliding window stream joins
【24h】

Quality-driven disorder handling for m-way sliding window stream joins

机译:m路滑动窗流连接的质量驱动的异常处理

获取原文

摘要

Sliding window join is one of the most important operators for stream applications. To produce high quality join results, a stream processing system must deal with the ubiquitous disorder within input streams which is caused by network delay, parallel processing, etc. Disorder handling involves an inevitable tradeoff between the latency and the quality of produced join results. To meet different requirements of stream applications, it is desirable to provide a user-configurable result-latency vs. result-quality tradeoff. Existing disorder handling approaches either do not provide such configurability, or support only user-specified latency constraints. In this work, we advocate the idea of quality-driven disorder handling, and propose a buffer-based disorder handling approach for sliding window joins, which minimizes sizes of input-sorting buffers, thus the result latency, while respecting user-specified result-quality requirements. The core of our approach is an analytical model which directly captures the relationship between sizes of input buffers and the produced result quality. Our approach is generic. It supports m-way sliding window joins with arbitrary join conditions. Experiments on real-world and synthetic datasets show that, compared to the state of the art, our approach can reduce the result latency incurred by disorder handling by up to 95% while providing the same level of result quality.
机译:滑动窗口联接是流应用程序中最重要的运算符之一。为了产生高质量的连接结果,流处理系统必须处理由于网络延迟,并行处理等导致的输入流中的普遍存在的混乱。无序处理涉及在等待时间和产生的连接结果的质量之间不可避免的权衡。为了满足流应用的不同要求,期望提供用户可配置的结果等待时间与结果质量的权衡。现有的混乱处理方法或者不提供这种可配置性,或者仅支持用户指定的等待时间约束。在这项工作中,我们提倡质量驱动的无序处理的想法,并提出了一种用于滑动窗口连接的基于缓冲区的无序处理方法,该方法可最小化输入排序缓冲区的大小,从而最大程度地减少了结果等待时间,同时尊重用户指定的结果-质量要求。我们方法的核心是一个分析模型,该模型直接捕获输入缓冲区的大小与产生的结果质量之间的关系。我们的方法是通用的。它支持具有任意联接条件的m向滑动窗口联接。在现实世界和合成数据集上的实验表明,与现有技术相比,我们的方法可以将乱序处理导致的结果延迟降低多达95%,同时提供相同水平的结果质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号