首页> 外文会议>32nd International Conference on Very Large Data Bases(VLDB 2006) vol.2 >Providing Resiliency to Load Variations in Distributed Stream Processing
【24h】

Providing Resiliency to Load Variations in Distributed Stream Processing

机译:为分布式流处理中的负载变化提供弹性

获取原文
获取原文并翻译 | 示例

摘要

Scalability in stream processing systems can be achieved by using a cluster of computing devices. The processing burden can, thus, be distributed among the nodes by partitioning the query graph. The specific operator placement plan can have a huge impact on performance. Previous work has focused on how to move query operators dynamically in reaction to load changes in order to keep the load balanced. Operator movement is too expensive to alleviate short-term bursts; moreover, some systems do not support the ability to move operators dynamically. In this paper, we develop algorithms for selecting an operator placement plan that is resilient to changes in load. In other words, we assume that operators cannot move, therefore, we try to place them in such a way that the resulting system will be able to withstand the largest set of input rate combinations. We call this a resilient placement.This paper first formalizes the problem for operators that exhibit linear load characteristics (e.g., filter, aggregate), and introduces a resilient placement algorithm. We then show how we can extend our algorithm to take advantage of additional workload information (such as known minimum input stream rates). We further show how this approach can be extended to operators that exhibit non-linear load characteristics (e.g., join). Finally, we present prototype- and simulation-based experiments that quantify the benefits of our approach over existing techniques using real network traffic traces.
机译:流处理系统中的可伸缩性可以通过使用一组计算设备来实现。因此,通过划分查询图,可以在节点之间分配处理负担。具体的操作员安置计划可能会对性能产生巨大影响。先前的工作集中在如何动态响应查询操作符以响应负载变化以保持负载平衡。操作员移动过于昂贵,无法减轻短期突发事件。此外,某些系统不支持动态移动操作员的功能。在本文中,我们开发了用于选择对负载变化具有弹性的操作员布置计划的算法。换句话说,我们假设操作员无法移动,因此,我们尝试将其放置为使所得的系统能够承受最大的输入速率组合。我们将其称为弹性放置。本文首先将表现出线性负载特征(例如过滤器,集合)的操作员的问题形式化,并介绍了弹性放置算法。然后,我们展示了如何扩展算法以利用其他工作负载信息(例如已知的最小输入流速率)。我们进一步展示了如何将该方法扩展到具有非线性负载特征(例如,联接)的算子。最后,我们介绍了基于原型和基于仿真的实验,这些实验使用真实的网络流量跟踪来量化我们的方法相对于现有技术的收益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号