首页> 外文OA文献 >Proactive Online Scheduling for Shuffle Grouping in Distributed Stream Processing Systems
【2h】

Proactive Online Scheduling for Shuffle Grouping in Distributed Stream Processing Systems

机译:分布式流处理系统中随机分组的主动在线调度

摘要

Shuffle grouping is a technique used by stream processing frameworks to share input load among parallel instances of stateless operators. With shuffle grouping each tuple of a stream can be assigned to any available operator instance, independently from any previous assignment. A common approach to implement shuffle grouping is to adopt a round robin policy, a simple solution that fares well as long as the tuple execution time is constant. However, such assumption rarely holds in real cases where execution time strongly depends on tuple content. As a consequence, parallel stateless operators within stream processing applications may experience unpredictable unbalance that, in the end, causes undesirable increase in tuple completion times. In this paper we propose Proactive Online Shuffle Grouping (POSG), a novel approach to shuffle grouping aimed at reducing the overall tuple completion time. POSG estimates the execution time of each tuple, enabling a proactive and online scheduling of input load to the target operator instances. Sketches are used to efficiently store the otherwise large amount of information required to schedule incoming load. We provide a probabilistic analysis and illustrate, through both simulations and a running prototype, its impact on stream processing applications.
机译:随机分组是流处理框架用于在无状态运算符的并行实例之间共享输入负载的技术。通过随机分组,可以将流的每个元组分配给任何可用的运算符实例,而与任何先前的分配无关。实现随机分组的一种常见方法是采用循环策略,这是一个简单的解决方案,只要元组执行时间是恒定的,它的效果就很好。但是,这种假设在执行时间强烈依赖于元组内容的实际情况中很少成立。结果,流处理应用程序中的并行无状态运算符可能会遇到不可预测的不平衡,最终导致元组完成时间的不期望增加。在本文中,我们提出了主动在线随机分组(POSG),这是一种新颖的随机分组方法,旨在减少整体元组完成时间。 POSG估计每个元组的执行时间,从而可以对目标操作员实例的输入负载进行主动和在线调度。草图用于有效地存储计划传入负载所需的其他大量信息。我们提供了概率分析,并通过模拟和运行中的原型说明了其对流处理应用程序的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号