首页> 外文会议>IEEE International conference on cluster computing >Disk cache-aware task scheduling for data-intensive and many-task workflow
【24h】

Disk cache-aware task scheduling for data-intensive and many-task workflow

机译:磁盘缓存感知任务调度,用于数据密集型和多任务工作流

获取原文

摘要

Workflow scheduling to maximize I/O performance is one of the key issues in data-intensive, many-task computing. In our previous work, we proposed locality-aware workflow scheduling method using the Multi-Constraint Graph Partitioning. In this work, we focus on read performance of input files from the disk cache (buffer cache or page cache on main memory). In order to maximize the disk cache hit rate of input files, a LIFO-order scheduling is effective since created intermediate files may be read soon. However, LIFO policy has a disadvantage of so-called “trailing task problem.” We propose a hybrid scheduling strategy of LIFO and HRF (Highest Rank First). In our strategy, one of two policies is applied depending on the number of highest-rank tasks in the queue to avoid the problem. In addition, scheduling for the overlap of computation and I/O is proposed. We implement our scheduling strategy for the Pwrake workflow system and the Gfarm distributed file system and evaluate it by executing data-intensive workflows using a computer cluster. Our scheduling strategy improves the performance of copyfile workflow by 30% due to increase in disk cache hit rate, and the performance of Montage workflow by 12% due to increase in core utilization.
机译:最大化I / O性能的工作流调度是数据密集型多任务计算中的关键问题之一。在我们以前的工作中,我们提出了使用多约束图分区的可感知位置的工作流调度方法。在这项工作中,我们专注于从磁盘缓存(缓冲区缓存或主内存上的页面缓存)读取输入文件的性能。为了最大化输入文件的磁盘高速缓存命中率,LIFO顺序计划是有效的,因为创建的中间文件可能很快就会被读取。但是,后进先出政策有一个缺点,即所谓的“后勤任务问题”。我们提出了LIFO和HRF(最高排名第一)的混合调度策略。在我们的策略中,根据队列中最高级别任务的数量,应用了两种策略之一来避免该问题。另外,提出了用于计算和I / O的重叠的调度。我们为Pwrake工作流系统和Gfarm分布式文件系统实施了调度策略,并通过使用计算机集群执行数据密集型工作流来对其进行评估。由于磁盘高速缓存命中率的提高,我们的调度策略将复制文件工作流的性能提高了30%,而由于核心利用率的提高,蒙太奇工作流的性能提高了12%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号