Performance Driven Multi-Objective Distributed Scheduling for Parallel Computations

Ankur Narang; Abhinav Srivastava; Naga Praveen Kumar Katta; Rudrapatna K. Shyamasundar

首页> 外文期刊>Operating systems review >Performance Driven Multi-Objective Distributed Scheduling for Parallel Computations

【24h】

Performance Driven Multi-Objective Distributed Scheduling for Parallel Computations

机译：性能驱动的多目标分布式并行计算调度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the advent of many-core architectures and strong need for Petascale (and Exascale) performance in scientific domains and industry analytics, efficient scheduling of parallel computations for higher productivity and performance has become very important. Further, movement of massive amounts (Terabytes to Petabytes) of data is very expensive, which necessitates affinity driven computations. Therefore, distributed scheduling of parallel computations on multiple places needs to optimize multiple performance objectives: follow affinity maximally and ensure efficient space, time and message complexity. Simultaneous consideration of these objectives makes distributed scheduling a particularly challenging problem. In addition, parallel computations have data dependent execution patterns which requires online scheduling to effectively optimize the computation orchestration as it unfolds. This paper presents an online algorithm for affinity driven distributed scheduling of multi-place parallel computations. To optimize multiple performance objectives simultaneously, our algorithm uses a low time and message complexity mechanism for ensuring affinity and a randomized work-stealing mechanism within places for load balancing. Theoretical analysis of the expected and probabilistic lower and upper bounds on time and message complexity of this algorithm has been provided. On multi-core clusters such as Blue Gene/P (MPP architecture) and Intel multi-core cluster, we demonstrate performance close to the custom MPI+Pthreads code. Further, strong, weak and data (increasing input data size) scalability have been demonstrated on multi-core clusters. Using well known benchmarks, we demonstrate 16% to 30% performance gain as compared to Cilk [6] on multi-core Intel Xeon 5570 (NUMA) architecture. Detailed experimental analysis illustrates efficient space (main memory) utilization as well. To the best of our knowledge, this is the first time multi-objective affinity driven distributed scheduling algorithm has been designed, theoretically analyzed and experimentally evaluated in a multi-place setup for multi-core cluster architectures.

机译：随着多核体系结构的出现以及在科学领域和行业分析中对Petascale（和Exascale）性能的强烈需求，有效调度并行计算以提高生产率和性能变得非常重要。此外，海量数据（兆字节至PB）的移动非常昂贵，这需要亲和力驱动的计算。因此，在多个位置进行并行计算的分布式调度需要优化多个性能目标：最大程度地遵循亲和力并确保有效的空间，时间和消息复杂性。同时考虑这些目标使分布式调度成为一个特别具有挑战性的问题。此外，并行计算具有与数据相关的执行模式，该模式需要在线调度才能在展开时有效地优化计算流程。本文提出了一种在线算法，用于多场所并行计算的相似性驱动的分布式调度。为了同时优化多个性能目标，我们的算法使用了低时间和消息复杂性机制来确保亲和力，并在负载均衡的地方采用了随机的工作窃取机制。对该算法的时间和消息复杂度的预期和概率上下限进行了理论分析。在Blue Gene / P（MPP体系结构）和Intel多核群集等多核群集上，我们展示了接近自定义MPI + Pthreads代码的性能。此外，已经在多核群集上证明了强大，弱小的和数据（增加输入数据大小）可伸缩性。使用众所周知的基准测试，与多核Intel Xeon 5570（NUMA）架构上的Cilk [6]相比，我们证明了16％到30％的性能提升。详细的实验分析还说明了有效的空间（主内存）利用率。据我们所知，这是首次在多核集群体系结构的多地点设置中设计，理论分析和实验评估多目标相似性驱动的分布式调度算法。

著录项

来源
《Operating systems review》 |2011年第2期|p.14-27|共14页
作者
Ankur Narang; Abhinav Srivastava; Naga Praveen Kumar Katta; Rudrapatna K. Shyamasundar;
展开▼
作者单位

IBM Research - India, New Delhi;

IBM Research - India, New Delhi;

IBM Research - India, New Delhi;

Tata Institute of Fundamental Research, Mumbai;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Performance driven distributed scheduling of parallel hybrid computations [J] . Ankur Narang, Rudrapatna K. Shyamasundar Theoretical computer science . 2011,第32期

机译：性能驱动的并行混合计算分布式调度
2. Late parallelization and feedback approaches for distributed computation of evolutionary multi-objective optimization algorithms [J] . Altinoz O. Tolga, Deb Kalyanmoy Neural computing & applications . 2018,第3期

机译：进化多目标优化算法分布式计算的延迟并行化和反馈方法
3. Distributed evolutionary multi-objective mesh-partitioning algorithm for parallel finite element computations [J] . A. Rama Mohan Rao Computers & Structures . 2009,第23a24期

机译：并行有限元计算的分布式进化多目标网格划分算法
4. Affinity Driven Distributed Scheduling Algorithm for Parallel Computations [C] . Ankur Narang, Abhinav Srivastava, Naga Praveen Kumar, Distributed computing and networking . 2011

机译：基于亲和力的分布式并行调度算法
5. Environmental Protection-Based Optimization Approach Using Parallel Computation for Renewable Distributed Generation Scheduling in a Competitive Electricity Market [D] . Okunade, Paul. 2018

机译：竞争性电力市场中基于环保的并行计算可再生分布式发电调度优化方法
6. Computational modeling and multi-objective optimization of engine performance of biodiesel made with castor oil [O] . Jonah Chukwudi Umeuzuegbu, Stanley Okiy, Chidozie Chukwuemeka Nwobi-Okoye, 2021

机译：用蓖麻油制成的生物柴油发动机性能的计算模型与多目标优化
7. Performance Driven Multi-Objective Distributed Scheduling for Parallel Computations [O] . Ankur Narang, Abhinav Srivastava, Naga Praveen, 2014

机译：并行计算的性能驱动多目标分布式调度

Performance Driven Multi-Objective Distributed Scheduling for Parallel Computations

摘要

著录项

相似文献

相关主题

期刊订阅