首页> 外文会议>IEEE International Conference on e-Science >Simulating HEP Workflows on Heterogeneous Architectures
【24h】

Simulating HEP Workflows on Heterogeneous Architectures

机译:在异构架构上模拟HEP工作流程

获取原文

摘要

The next generation of supercomputing facilities, such as Oak Ridge's Summit and Lawrence Livermore's Sierra, show an increasing use of GPGPUs and other accelerators in order to achieve their high FLOP counts. This trend will only grow with exascale facilities. In general, High Energy Physics computing workflows have made little use of GPUs due to the relatively small fraction of kernels that run efficiently on GPUs, and the expense of rewriting code for rapidly evolving GPU hardware. However, the computing requirements for high-luminosity LHC are enormous, and it will become essential to be able to make use of supercomputing facilities that rely heavily on GPUs and other accelerator technologies. ATLAS has already developed an extension to AthenaMT, its multithreaded event processing framework, that enables the non-intrusive offloading of computations to external accelerator resources, and is developing strategies to schedule the offloading efficiently. Before investing heavily in writing many kernels, we need to better understand the performance metrics and throughput bounds of the workflows with various accelerator configurations. This can be done by simulating the workflows, using real metrics for task interdependencies and timing, as we vary fractions of offloaded tasks, latencies, data conversion speeds, memory bandwidths, and accelerator offloading parameters such as CPU/GPU ratios and speeds. We present the results of these studies, which will be instrumental in directing effort to make the ATLAS framework, kernels and workflows run efficiently on exascale facilities.
机译:下一代超级计算设施,例如橡树岭的峰会和劳伦斯利弗尔的塞拉,越来越多地利用GPGPU和其他加速器,以实现其高拖鞋计数。这一趋势只会与ExaScale设施一起增长。通常,高能量物理计算工作流程几乎没有使用GPU由于GPU上有效运行的相对较小的核,以及用于快速发展GPU硬件的重写代码的费用。然而,高亮度LHC的计算要求是巨大的,并且能够利用超级计算设施依赖GPU和其他加速技术的超级计算设施成为必要的。 Atlas已经为Athenamt,其多线程事件处理框架开发了一个扩展,使得计算计算到外部加速器资源,并且正在开发有效调度卸载的策略。在投资庞大的写作许多内核之前,我们需要更好地了解各种加速器配置的工作流的性能指标和吞吐量界限。这可以通过模拟工作流程,使用实际指标来完成任务相互依赖性和时序,因为我们改变了卸载任务,延迟,数据转换速度,内存带宽和加速器卸载参数,例如CPU / GPU比率和速度。我们介绍了这些研究的结果,这将有助于指导努力使地图集框架,内核和工作流程有效地在ExaScale设施上运行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号