首页> 外文会议>IEEE Annual Computer Software and Applications Conference >A Market-Oriented Heuristic Algorithm for Scheduling Parallel Applications in Big Data Service Platform
【24h】

A Market-Oriented Heuristic Algorithm for Scheduling Parallel Applications in Big Data Service Platform

机译:面向市场的启发式大数据服务平台中并行应用调度算法

获取原文

摘要

Big Data analytics service platform delivers a new type of public cloud offerings, through which end users can outsource their job executions by using a group of professional Big Data processing services in a pay-per-use way. Different from other type of cloud services, parallel jobs dominate the domain of data processing services, whose execution time can be varied greatly with different runtime configurations, such as different degrees of parallelism. In such a market-oriented environment, scheduling jobs from end users efficiently to optimize the Big Data analytics service platform's revenue is a more challenging task. In this paper, we propose a market-oriented heuristic algorithm for scheduling parallel jobs in a Big Data analytics service platform with admission control to optimize the platform operator's revenue. The proposed scheduling heuristic takes into account not only the dynamic revenue gained from accomplishing a job within a specific runtime as well as the consumption of resources needed for running it to achieve this given runtime, but also the potential loss it causes to the system by running this job instead of other waiting jobs currently in the system. We also propose a collaborative filtering based approach to quickly and accurately predict the execution time of parallel jobs running in a Big Data analytics service platform. We have conducted extensive experiments and simulations based on workload data derived from the real-world data analytics service platform and parallel applications. We show that our scheduler can outperform the other scheduling algorithms used for comparison, which are based on classical heuristics from literature, thereby fully evaluating the effectiveness of our market-oriented heuristic scheduling algorithm.
机译:大数据分析服务平台提供了一种新型的公共云产品,最终用户可以通过按使用量付费使用一组专业的大数据处理服务来外包其工作执行。与其他类型的云服务不同,并行作业在数据处理服务的领域中占主导地位,其执行时间会随不同的运行时配置(例如不同的并行度)而大大不同。在这种面向市场的环境中,有效地调度最终用户的作业以优化大数据分析服务平台的收入是一项更具挑战性的任务。在本文中,我们提出了一种面向市场的启发式算法,用于在具有准入控制的大数据分析服务平台中调度并行作业,以优化平台运营商的收入。拟议的调度启发式方法不仅考虑到在特定运行时内完成任务所获得的动态收益,还考虑了为实现给定运行时而运行该工作所需的资源消耗,还考虑了运行时对系统造成的潜在损失该作业,而不是系统中当前正在等待的其他作业。我们还提出了一种基于协作过滤的方法,可以快速准确地预测在大数据分析服务平台中运行的并行作业的执行时间。我们已经根据来自真实数据分析服务平台和并行应用程序的工作负载数据进行了广泛的实验和模拟。我们展示了我们的调度程序可以胜过其他用于比较的调度算法,这些算法都是基于文献中的经典启发式算法,从而充分评估了我们面向市场的启发式调度算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号