首页> 外文会议>IEEE International Conference on Software Engineering and Service Science >A Scheduling System for Big Data Hybrid Computing Workflow
【24h】

A Scheduling System for Big Data Hybrid Computing Workflow

机译:大数据混合计算工作流调度系统

获取原文

摘要

With the increasing usage of big data, the types of big data technologies have also become diverse. When solving a particular problem, it often involves many different types of big data tasks. How to realize the hybrid scheduling of different types of tasks is an urgent problem to be solved. Before this, the industry used crontab to schedule big data tasks regularly, it can conveniently execute system task scheduling and user task scheduling in Linux environment, but it cannot meet the scheduling needs of complex business scenarios and it requires users to write their own submission logic. Therefore, this paper designs a hybrid scheduling system for big data tasks based on Airflow. The system supports the construction of different types of big data tasks into a workflow, and scheduling these tasks based on workflow. At the same time, the scheduling module is independent of other modules, which reduces the coupling degree between the modules. The method proposed in this paper has been applied to big data platform, and the effectiveness of the method has been verified.
机译:随着大数据使用的增加,大数据技术的类型也变得多样化。解决特定问题时,它通常涉及许多不同类型的大数据任务。如何实现不同类型任务的混合调度是一个亟待解决的问题。在此之前,业界使用crontab定期调度大数据任务,可以在Linux环境中方便地执行系统任务调度和用户任务调度,但是它不能满足复杂业务场景的调度需求,并且需要用户编写自己的提交逻辑。因此,本文设计了一种基于Airflow的混合型大数据任务调度系统。该系统支持将不同类型的大数据任务构建到工作流中,并根据工作流调度这些任务。同时,调度模块独立于其他模块,降低了模块之间的耦合度。本文提出的方法已应用于大数据平台,并验证了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号