首页> 外文期刊>Journal of Parallel and Distributed Computing >Model-driven scheduling for distributed stream processing systems
【24h】

Model-driven scheduling for distributed stream processing systems

机译:分布式流处理系统的模型驱动调度

获取原文
获取原文并翻译 | 示例
           

摘要

Distributed Stream Processing Systems (DSPS) are “Fast Data” platforms that allow streaming applications to be composed and executed with low latency on commodity clusters and Clouds. Such applications are composed as a Directed Acyclic Graph (DAG) of tasks, with data parallel execution using concurrent task threads on distributed resource slots. Scheduling such DAGs for DSPS has two parts—allocationof threads and resources for a DAG, andmappingthreads to resources. Existing schedulers often address just one of these, make the assumption that performance linearly scales, or usead hocempirical tuning at runtime. Instead, we propose model-driven techniques for both mapping and allocation that rely on low-overheada prioriperformance modeling of tasks. Our scheduling algorithms are able to offer predictable and low resource needs that is suitable for elastic pay-as-you-go Cloud resources, support a high input rate through high VM utilization, and can be combined with other mapping approaches as well. These are validated for micro and application benchmarks, and compared with contemporary schedulers, for the Apache Storm DSPS.
机译:分布式流处理系统(DSPS)是“快速数据”平台,允许在商品集群和云上以低延迟的方式编写和执行流应用程序。此类应用程序由任务的有向无环图(DAG)组成,使用分布式资源插槽上的并发任务线程并行执行数据。为DSPS安排此类DAG包括两个部分:为DAG分配线程和资源,以及将线程映射到资源。现有的调度程序通常仅解决其中之一,并假设性能线性扩展,或者在运行时使用临时经验调整。相反,我们提出了依赖于任务的低开销先验性能建模的模型驱动技术进行映射和分配。我们的调度算法能够提供可预测的低资源需求,适用于弹性即付即用的云资源,通过高VM使用率支持高输入率,并且还可以与其他映射方法结合使用。这些已针对微型和应用程序基准进行了验证,并与现代调度程序进行了比较,以用于Apache Storm DSPS。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号