首页> 外文会议>IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing >Data Driven Priority Scheduling on a Spark Streaming System
【24h】

Data Driven Priority Scheduling on a Spark Streaming System

机译:Spark流系统上的数据驱动优先级调度

获取原文

摘要

Big data has become essential for businesses as it enables companies and organizations to gather insights from their data and use it to determine marketing opportunities, assist decision-making or even to find new business opportunities. Companies spend a great deal of effort collecting large amounts of data, which in some cases must be processed in real-time in order to capitalize on business opportunities. Predicting the expected input load at a given point in time can be very difficult and sometimes impossible. As a result, a great deal of effort is put into creating techniques to address varying input loads. A widely used approach is dynamic resource provisioning, but resource provisioners may not react in time to address the resource shortage which can result in increased processing latencies. This paper presents a priority scheduling technique that can be used in conjunction with dynamic and static resource provisioning. This approach allows users to assign a priority to input data items. The scheduler ensures that higher priority data items are given precedence over lower priority data items. This means that when resources become constrained the higher priority data items receive a greater share of resources and experience lower queueing delays in comparison to low priority items. A prototype for the data driven priority scheduler is implemented on the Spark Streaming system.
机译:大数据已成为企业必不可少的要素,因为它使公司和组织能够从其数据中收集见解,并利用它来确定营销机会,协助决策甚至寻找新的商机。公司花费大量精力收集大量数据,在某些情况下,必须实时处理这些数据,才能利用商机。在给定的时间点预测预期的输入负载可能非常困难,有时甚至是不可能的。结果,在创建技术上花费了大量的精力来解决变化的输入负载。一种广泛使用的方法是动态资源供应,但是资源供应商可能无法及时做出反应以解决资源短缺问题,这可能导致处理延迟增加。本文提出了一种优先级调度技术,该技术可与动态和静态资源供应结合使用。这种方法允许用户为输入数据项分配优先级。调度程序确保优先级较高的数据项优先于优先级较低的数据项。这意味着,当资源受到限制时,与低优先级项相比,较高优先级的数据项将获得更多的资源份额,并且排队等待的时间也较短。在Spark Streaming系统上实现了数据驱动优先级调度程序的原型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号