Data-Locality Aware Dynamic Schedulers for Independent Tasks with Replicated Inputs

机译：具有重复输入的独立任务的数据局部性动态调度程序

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper we concentrate on a crucial parameter for efficiency in Big Data and HPC applications: data locality. We focus on the scheduling of a set of independant tasks, each depending on an input file. We assume that each of these input files has been replicated several times and placed in local storage of different nodes of a cluster, similarly of what we can find on HDFS system for example. We consider two optimization problems, related to the two natural metrics: makespan optimization (under the constraint that only local tasks are allowed) and communication optimization (under the constraint of never letting a processor idle in order to optimize makespan). For both problems we investigate the performance of dynamic schedulers, in particular the basic greedy algorithm we can for example find in the default MapReduce scheduler. First we theoretically study its performance, with probabilistic models, and provide a lower bound for communication metric and asymptotic behaviour for both metrics. Second we propose simulations based on traces from a Hadoop cluster to compare the different dynamic schedulers and assess the expected behaviour obtained with the theoretical study.

机译：在本文中，我们专注于提高大数据和HPC应用程序效率的关键参数：数据局部性。我们专注于安排一组独立的任务，每个任务都取决于一个输入文件。我们假设每个输入文件已被复制多次，并放置在群集中不同节点的本地存储中，类似于我们在HDFS系统上可以找到的文件。我们考虑了两个与两个自然指标有关的优化问题：makepan优化（在只允许本地任务的约束下）和通信优化（在从不让处理器闲置以优化makepan的约束下）。对于这两个问题，我们研究了动态调度程序的性能，尤其是我们可以在默认MapReduce调度程序中找到的基本贪婪算法。首先，我们使用概率模型从理论上研究其性能，并为通信度量和两个度量的渐近行为提供一个下限。其次，我们提出了基于Hadoop集群跟踪的模拟，以比较不同的动态调度程序并评估通过理论研究获得的预期行为。

著录项

来源
《IEEE International Parallel and Distributed Processing Symposium Workshops》|2018年|1206-1213|共8页
会议地点
作者
Olivier Beaumont; Thomas Lambert; Loris Marchal; Bastien Thomas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Task analysis; Measurement; Dynamic scheduling; Distributed databases; Runtime; Big Data; Optimization;

机译：任务分析;测量;动态调度;分布式数据库;运行时;大数据;优化;

相似文献

外文文献
中文文献
专利

1. Performance analysis and optimality results for data-locality aware tasks scheduling with replicated inputs [J] . Olivier Beaumont, Thomas Lambert, Loris Marchal, Future generation computer systems . 2020,第Octa期

机译：数据局部地意识到与复制输入调度的数据局部意识任务的性能分析和最优性
2. SLA-aware task scheduling and data replication for enhancing provider profit in clouds [J] . Amel Khelifa, Tarek Hamrouni, Riad Mokadem, Procedia Computer Science . 2020,第5期

机译：SLA感知任务调度和数据复制，用于增强云中的提供商利润
3. Applying Dynamic Priority Scheduling Scheme to Static Systems of Pinwheel Task Model in Power-Aware Scheduling [J] . Ye-InSeol, Young-KukKim ScientificWorldJournal . 2014,第3期

机译：应用动态优先级调度方案在电动感知调度中对电动机任务模型的静态系统
4. Data-Locality Aware Dynamic Schedulers for Independent Tasks with Replicated Inputs [C] . Olivier Beaumont, Thomas Lambert, Loris Marchal, IEEE International Parallel and Distributed Processing Symposium Workshops . 2018

机译：数据 - 局部性意识到具有复制输入的独立任务的动态调度程序
5. Configuration-aware and QoS-aware Task Scheduling in Real-Time Adaptive Embedded Systems. [D] . Kooti, Hessam. 2012

机译：实时自适应嵌入式系统中的配置感知和QoS感知任务调度。
6. Applying Dynamic Priority Scheduling Scheme to Static Systems of Pinwheel Task Model in Power-Aware Scheduling [O] . Ye-In Seol, Young-Kuk Kim -1

机译：动态优先级调度方案在动力感知型风车任务模型静态系统中的应用
7. Data-Locality Aware Dynamic Schedulers for Independent Tasks with Replicated Inputs [O] . Olivier Beaumont, Thomas Lambert, Loris Marchal, 2018

机译：数据 - 局部性意识到具有复制输入的独立任务的动态调度程序

Data-Locality Aware Dynamic Schedulers for Independent Tasks with Replicated Inputs

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅