Towards Communication Profile, Topology and Node Failure Aware Process Placement

机译：面向通信配置文件，拓扑和节点故障感知过程的放置

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

HPC systems need to keep growing in size to meet the ever-increasing demand for high levels of capability and capacity, often in tight time windows for urgent computation. However, increasing the size, complexity and heterogeneity of HPC systems also increases the risk and impact of system failures, that result in resource waste and aborted jobs. A major contributor to job completion time is the cost of interprocess communication. To address performance and energy efficiency, several prior studies have targeted improvements of communication locality. To meet this goal, they derive a mapping of MPI processes to system nodes in a way that reduces communication cost. However, such approaches disregard the effect of system failures. In this work, we propose a resource allocation approach for MPI jobs, considering both high performance and error resilience. Our approach, named Communication Profile, Topology and node Failure (CPTF), takes into account the application's communication profile, system topology and node failure probability for assigning job processes to nodes. We evaluate variants of CPTF through simulations of two MPI applications, one with a regular communication pattern (LAMMPS) and one with an irregular one (NPB-DT). In both cases, the variant of CPTF that strives to avoid failure-prone nodes and communication paths achieves lower time to complete job batches when compared to the default resource allocation policy of Slurm. It also exhibits the lowest ratio of aborted jobs. The average improvement in batch completion time is 67% for NPB-DT and 34% for LAMMPS.

机译：HPC系统需要保持不断增长的规模，以满足对高水平能力和容量不断增长的需求，通常需要在紧迫的时间范围内进行紧急计算。但是，增加HPC系统的大小，复杂性和异构性也会增加系统故障的风险和影响，从而导致资源浪费和作业中止。作业完成时间的一个主要因素是进程间通信的成本。为了解决性能和能源效率问题，一些现有研究的目标是改善通信位置。为了实现此目标，他们以降低通信成本的方式派生了MPI进程到系统节点的映射。但是，这种方法忽略了系统故障的影响。在这项工作中，我们考虑到高性能和错误恢复能力，提出了一种用于MPI作业的资源分配方法。我们的方法称为通信配置文件，拓扑和节点故障（CPTF），它考虑了应用程序的通信配置文件，系统拓扑结构和为节点分配作业过程的节点故障概率。我们通过仿真两种MPI应用程序来评估CPTF的变体，一种具有常规的通信模式（LAMMPS），另一种具有不规则的通信模式（NPB-DT）。在这两种情况下，与Slurm的默认资源分配策略相比，CPTF的变体都在努力避免容易出现故障的节点和通信路径，从而缩短了完成作业批处理的时间。它也表现出最低的中止工作率。对于NPB-DT，批处理完成时间的平均改善为67％，对于LAMMPS，则为34％。

著录项

来源
《IEEE International Symposium on Computer Architecture and High Performance Computing》|2020年|241-248|共8页
会议地点
作者
Ioannis Vardas; Manolis Ploumidis; Manolis Marazakis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Failure aware resource allocation; Resilience; MPI parallel jobs;

机译：故障感知资源分配;弹性; MPI并行作业;

相似文献

外文文献
中文文献
专利

1. Energy-aware node placement, topology control and MAC scheduling for wireless sensor networks [J] . Chih-Yung Chang, Hsu-Ruey Chang Computer networks . 2008,第11期

机译：无线传感器网络的能量感知节点放置，拓扑控制和MAC调度
2. To 4,000 Compute Nodes and Beyond: Network-aware Vertex Placement in Large-scale Graph Processing Systems [J] . Karim Awara, Hani Jamjoom, Panos Kalnis Computer communication review . 2013,第4期

机译：到4,000个计算节点及以后：大型图形处理系统中可感知网络的顶点
3. Robust node‐to‐node consensus of linear multiagent systems with directed switching topologies subject to uncertain pinning communications [J] . Wang Peijun, Yu Wenwu, Yu Xinghuo International Journal of Robust and Nonlinear Control . 2018,第5期

机译：具有指导切换拓扑的线性多轴系统的强大节点与节点共识，以便不确定循环通信
4. Communications-Aware Process Placement Taking into Account Symmetries of Topology [C] . ilinskas Julius International Conference on P2P, Parallel, Grid, Cloud and Internet Computing . 2013

机译：考虑拓扑对称性的通信感知过程放置
5. Accelerating MPI collective communications through hierarchical algorithms with flexible inter-node communication and imbalance awareness. [D] . Parsons, Benjamin S. 2015

机译：通过具有灵活的节点间通信和不平衡意识的分层算法来加速MPI集体通信。
6. WISC-IV Profile in High-Functioning Autism Spectrum Disorders: Impaired Processing Speed is Associated with Increased Autism Communication Symptoms and Decreased Adaptive Communication Abilities [O] . Rafael E. Oliveras-Rentas, Lauren Kenworthy, Richard B. Roberson III, -1

机译：WISC-IV在高功能性谱系谱中的配置文件：加工速度受损与增加的自闭症通信症状和减少的适应性通信能力有关
7. Locality and Topology aware Intra-node Communication Among Multicore CPUs [O] . Teng Ma, George Bosilca, Aurelien Bouteiller, 2010

机译：多核CpU之间的位置和拓扑感知节点内通信

Towards Communication Profile, Topology and Node Failure Aware Process Placement

摘要

著录项

相似文献

相关主题

期刊订阅