A Multi-faceted Approach to Job Placement for Improved Performance on Extreme-Scale Systems

机译：在极端规模的系统上提高绩效的多方位工作安置方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Job placement plays a pivotal role in application performance on supercomputers. We present a multi-faceted exploration to influence placement in extreme-scale systems, to improve network performance and decrease variability. In our first exploration, Scores, we developed a machine learning model that extracts features from a job's node-allocation and grades performance. This identified several important node-metrics that led to Dual-Ended scheduling, a means of reducing network contention without impacting utilization. In evaluations on the Titan supercomputer, we observed reductions in average hop-count by up to 50%. We also developed an improved node-layout strategy that targets a better balance between network latency and bandwidth, replacing the default ALPS layout on Titan that resulted in an average of 10% runtime improvement. Both of these efforts underscore the importance of a job placement strategy that is cognizant of workload mixture and network topology.

机译：作业放置在超级计算机上的应用程序性能中起着至关重要的作用。我们提出了一个多方面的探索，以影响极端规模系统中的放置，以改善网络性能并减少可变性。在我们的首次探索中，Scores开发了一种机器学习模型，该模型从作业的节点分配中提取特征并为绩效评分。这确定了导致双端调度的几个重要节点指标，这是在不影响利用率的情况下减少网络争用的一种方法。在Titan超级计算机上的评估中，我们观察到平均跳数减少了多达50％。我们还开发了一种改进的节点布局策略，旨在在网络延迟和带宽之间实现更好的平衡，替换了Titan上的默认ALPS布局，从而使运行时间平均提高了10％。这两项工作都凸显了认识工作负载混合和网络拓扑的工作安置策略的重要性。

著录项

来源
《International Conference for High Performance Computing, Networking, Storage and Analysis》|2016年|1015-1025|共11页
会议地点
作者
Christopher Zimmer; Saurabh Gupta; Scott Atchley; Sudharshan S. Vazhkudai; Carl Albing;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Resource management; Bandwidth; Benchmark testing; Three-dimensional displays; Layout; Processor scheduling; Visualization;

机译：资源管理;带宽;基准测试;三维显示;布局;处理器调度;可视化;

相似文献

外文文献
中文文献
专利

1. The application of a multi-faceted approach for evaluating and improving the life cycle environmental performance of service industries [J] . Scott O. Shrake, Melissa M. Bilec, Amy E. Landis Journal of Cleaner Production . 2013,第mara期

机译：多方面方法在评估和改善服务业生命周期环境绩效中的应用
2. Improved Performance of Agent Based Placement Cell System - A Performance Efficient Role Clustering Technique [J] . SOUMYA SURAVITA, PRABHAT RANJAN, R. K. SINGH, WSEAS Transactions on Computers . 2006,第10期

机译：基于代理的布局单元系统的改进性能-一种高效的角色聚类技术
3. Impact of relay placement in three-hop buffer-aided FSO systems: An approximate performance analysis approach [J] . El-Rajab Mirna, Abou-Rjeily Chadi Physical Communication . 2021,第Apra期

机译：继电器放置在三跳缓冲辅助FSO系统中的影响：近似性能分析方法
4. A Multi-Faceted Approach to Job Placement for Improved Performance on Extreme-Scale Systems [C] . Christopher Zimmer, Saurabh Gupta, Scott Atchley, International Conference for High Performance Computing, Networking, Storage and Analysis . 2016

机译：在极端级系统上提高性能的工作安置的多刻度方法
5. Topology-Aware Job Scheduling and Placement in High Performance Computing and Edge Computing Systems [D] . Li, Kangkang. 2019

机译：高性能计算和边缘计算系统中的拓扑知识作业调度和放置
6. Systems approach to assessing and improving local human research Institutional Review Board performance [O] . John Fontanesi, Anthony Magit, Jennifer J. Ford, 2018

机译：评估和改善当地人类研究机构审查委员会绩效的系统方法
7. A practical approach to reconciling availability, performance, and capacity in provisioning extreme-scale storage systems [O] . Lipeng Wan, Feiyi Wang, Sarp Oral, 2015

机译：在提供极限存储系统中协调可用性，性能和容量的实用方法

A Multi-faceted Approach to Job Placement for Improved Performance on Extreme-Scale Systems

摘要

著录项

相似文献

相关主题

期刊订阅