Crossing the architectural barrier: Evaluating representative regions of parallel HPC applications

机译：跨越架构障碍：评估并行HPC应用程序的代表性区域

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Exascale computing will get mankind closer to solving important social, scientific and engineering problems. Due to high prototyping costs, High Performance Computing (HPC) system architects make use of simulation models for design space exploration and hardware-software co-design. However, as HPC systems reach exascale proportions, the cost of simulation increases, since simulators themselves are largely single-threaded. Tools for selecting representative parts of parallel applications to reduce running costs are widespread, e.g., BarrierPoint achieves this by analysing, in simulation, abstract characteristics such as basic blocks and reuse distances. However, architectures new to HPC have a limited set of tools available. In this work, we provide an independent cross-architectural evaluation on real hardware - across Intel and ARM - of the BarrierPoint methodology, when applied to parallel HPC proxy applications. We present both cases: when the methodology can be applied and when it cannot. In the former case, results show that we can predict the performance of full application execution by running shorter representative sections. In the latter case, we dive into the underlying issues and suggest improvements. We demonstrate a total simulation time reduction of up to 178x, whilst keeping the error below 2.3% for both cycles and instructions.

机译：万亿级计算将使人类更接近解决重要的社会，科学和工程问题。由于高昂的原型设计成本，高性能计算（HPC）系统架构师将仿真模型用于设计空间探索和软硬件协同设计。但是，随着HPC系统达到百亿亿美元级的规模，由于仿真器本身主要是单线程的，因此仿真的成本增加了。选择并行应用程序代表性部分以降低运行成本的工具非常广泛，例如，BarrierPoint通过在仿真中分析抽象特征（例如基本块和重用距离）来实现这一目标。但是，HPC的新体系结构只能使用有限的一组工具。在这项工作中，当将BarrierPoint方法应用于并行HPC代理应用程序时，我们将对跨Intel和ARM的真实硬件进行独立的跨体系结构评估。我们介绍两种情况：什么时候可以应用方法论，什么时候不能应用。在前一种情况下，结果表明我们可以通过运行较短的代表部分来预测完整应用程序执行的性能。在后一种情况下，我们将深入研究潜在问题并提出改进建议。我们证明了整个仿真时间最多可减少178倍，同时使周期和指令的误差均低于2.3％。

著录项

来源
《2017 IEEE International Symposium on Performance Analysis of Systems and Software》|2017年|109-120|共12页
会议地点 Santa Rosa(US)
作者
Alexandra Ferreóon; Radhika Jagtap; Sascha Bischoff; Roxana Ruşitoru;
展开▼
作者单位

Universidad de Zaragoza, Spain;

ARM Ltd., U.K.;

ARM Ltd., U.K.;

ARM Ltd., U.K.;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Hardware; Analytical models; Computational modeling; Computer architecture; Registers; Monte Carlo methods; Tools;

机译：硬件;分析模型;计算模型;计算机体系结构;寄存器; Monte Carlo方法;工具;

相似文献

外文文献
中文文献
专利

1. ENPASSANT: AN ENVIRONMENT FOR EVALUATING MASSIVELY PARALLEL ARRAY ARCHITECTURES FOR SPATIALLY MAPPED APPLICATIONS [J] . MARTIN C. HERBORDT, CHARLES C. WEEMS International Journal of Pattern Recognition and Artificial Intelligence . 1995,第2期

机译：附件：用于评估空间映射应用程序的大规模并行阵列体系结构的环境
2. Evaluating parallel architectures for two real-time applications with 100 kHz repetition rate (hadron collider data) [J] . Badier J., Bock R.K. IEEE Transactions on Nuclear Science . 1993,第1期

机译：评估具有100 kHz重复频率的两个实时应用的并行架构（强子对撞机数据）
3. Decadal application of WRF/Chem for regional air quality and climate modeling over the US under the representative concentration pathways scenarios. Part 1: Model evaluation and impact of downscaling [J] . Yahya Khairunnisa, Wang Kai, Campbell Patrick, Atmospheric environment . 2017,第mara期

机译：WRF / Chem在代表浓度路径情景下在美国区域空气质量和气候模拟中的十年应用。第1部分：模型评估和缩小规模的影响
4. Crossing the architectural barrier: Evaluating representative regions of parallel HPC applications [C] . Alexandra Ferreóon, Radhika Jagtap, Sascha Bischoff, IEEE International Symposium on Performance Analysis of Systems and Software . 2017

机译：穿过建筑障碍：评估并行HPC应用的代表区域
5. A journey through performance evaluation, tuning, and analysis of parallelized applications and parallel architectures: Quantitative approach. [D] . Mustafa, Dheya G. 2013

机译：并行应用程序和并行体系结构的性能评估，调整和分析的过程：定量方法。
6. Towards a HPC-oriented parallel implementation of a learning algorithm for bioinformatics applications [O] . Gianni DAngelo, Salvatore Rampone 2014

机译：面向面向HPC的生物信息学应用学习算法的并行实现
7. Evaluation of Paralleled Generation Architectures for Civil Aircraft Applications [O] . Theodoros Kostakis, Patrick Norman, Steven Fletcher, 2016

机译：民用飞机应用的并行发电架构评估

Crossing the architectural barrier: Evaluating representative regions of parallel HPC applications

摘要

著录项

相似文献

相关主题

期刊订阅