...
首页> 外文期刊>The Journal of Systems and Software >Cost-efficient dynamic scheduling of big data applications in apache spark on cloud
【24h】

Cost-efficient dynamic scheduling of big data applications in apache spark on cloud

机译:云中基于Apache Spark的经济高效的大数据应用动态调度

获取原文
获取原文并翻译 | 示例
           

摘要

Job scheduling is one of the most crucial components in managing resources, and efficient execution of big data applications. Specifically, scheduling jobs in a cloud-deployed cluster are challenging as the cloud offers different types of Virtual Machines (VMs) and jobs can be heterogeneous. The default big data processing framework schedulers fail to reduce the cost of VM usages in the cloud environment while satisfying the performance constraints of each job. The existing works in cluster scheduling mainly focus on improving job performance and do not leverage from VM types on the cloud to reduce cost. In this paper, we propose efficient scheduling algorithms that reduce the cost of resource usage in a cloud-deployed Apache Spark cluster. In addition, the proposed algorithms can also prioritise jobs based on their given deadlines. Besides, the proposed scheduling algorithms are online and adaptive to cluster changes. We have also implemented the proposed algorithms on top of Apache Mesos. Furthermore, we have performed extensive experiments on real datasets and compared to the existing schedulers to showcase the superiority of our proposed algorithms. The results indicate that our algorithms can reduce resource usage cost up to 34% under different workloads and improve job performance.
机译:作业调度是管理资源以及有效执行大数据应用程序中最关键的组件之一。具体来说,在部署了云的群集中调度作业具有挑战性,因为云提供了不同类型的虚拟机(VM),并且作业可能是异构的。默认的大数据处理框架计划程序无法在满足每个作业的性能约束的同时降低云环境中VM使用的成本。集群调度中的现有工作主要集中于提高作业性能,而不是利用云上的VM类型来降低成本。在本文中,我们提出了有效的调度算法,可以减少在云部署的Apache Spark集群中的资源使用成本。另外,所提出的算法还可以根据给定的期限对作业进行优先级排序。此外,所提出的调度算法是在线的并且适应集群变化。我们还在Apache Mesos之上实现了建议的算法。此外,我们对真实数据集进行了广泛的实验,并与现有调度程序进行了比较,以展示我们提出的算法的优越性。结果表明,在不同的工作负载下,我们的算法可以将资源使用成本降低多达34%,并提高了工作绩效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号