首页> 外文会议>International conference on artificial intelligence in China >Task Scheduling Strategy for Heterogeneous Spark Clusters
【24h】

Task Scheduling Strategy for Heterogeneous Spark Clusters

机译:异构火花集群的任务调度策略

获取原文

摘要

As a primary data processing and computing framework, Spark can support memory computing, interactive computing, and querying in a huge amount of data. Also, it can provide data mining, machine learning, stream computing, and the other services. However, the strategy of allocating resources among isomorphic processors cannot adapt to heterogeneous cluster environment due to its lack of load-based task scheduling. Therefore, we propose a dynamic load scheduling algorithm for heterogeneous Spark clusters by regularly collecting load information from each of the cluster node. Such an algorithm can dramatically reduce the allocation of load to the nodes which are already heavily loaded and in turn allocate more task to the idle nodes, thereby speeding up the process of job allocation in Spark. The experimental results show that the proposed algorithm can dramatically improve the computation efficiency by dynamically loading among the nodes in a heterogeneous cluster.
机译:作为主要数据处理和计算框架,Spark可以支持存储器计算,交互式计算和查询大量数据。此外,它可以提供数据挖掘,机器学习,流计算和其他服务。然而,由于其缺乏基于负载的任务调度,分配相同处理器之间的资源的策略不能适应异构的集群环境。因此,我们通过定期收集来自每个簇节点的负载信息来提出一种动态负载调度算法。这样的算法可以显着降低对已经大量加载的节点的负载分配,并且依次将更多任务分配给空闲节点,从而加速了火花的作业分配过程。实验结果表明,通过在异构群集中的节点之间动态加载,该算法可以大大提高计算效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号