首页> 外文OA文献 >Extreme-scaling applications en route to exascale
【2h】

Extreme-scaling applications en route to exascale

机译:极端扩展的应用程序将向万亿级规模发展

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Feedback from the previous year's very successful workshop motivated the organisation of a three-day workshop from 1 to 3 February 2016, during which the 28-rack JUQUEEN BlueGene/Q system with 458 752 cores was reserved for over 50 hours. Eight international code teams were selected to use this opportunity to investigate and improve their application scalability, assisted by staff from JSC Simulation Laboratories and Cross-Sectional Teams. Ultimately seven teams had codes successfully run on the full JUQUEEN system. Strong scalability demonstrated by Code Saturne and Seven-League Hydro, both using 4 OpenMP threads for 16 MPI processes on each compute node for a total of 1 835 008 threads, qualify them for High-Q Club membership. Existing members CIAO and iFETI were able to show that they had additional solvers which also scaled acceptably. Furthermore, large-scale in-situ interactive visualisation was demonstrated with a CIAO simulation using 458 752 MPI processes running on 28 racks coupled via JUSITU to VisIt. The two adaptive mesh refinement utilities, ICI and p4est, showed that they could respectively scale to run with 458 752 and 971 504 MPI ranks, but both encountered problems loading large meshes. Parallel file I/O issues also hindered large-scale executions of PFLOTRAN. Poor performance of a NEST-import module which loaded and connected 1.9 TiB of neuron and synapse data was tracked down to an internal data-structure mismatch with the HDF5 file objects that prevented use of MPI collective file reading, which when rectified is expected to enable large-scale neuronal network simulations.Comparative analysis is provided to the 25 codes in the High-Q Club at the start of 2016, which includes five codes that qualified from the previous workshop. Despite more mixed results, we learnt more about application file I/O limitations and inefficiencies which continue to be the primary inhibitor to large-scale simulations.
机译:上一年非常成功的研讨会的反馈促使组织了一次为期三天的研讨会,该研讨会于2016年2月1日至3日举行,期间将28机架的JUQUEEN BlueGene / Q系统和458 752个内核预留了50多个小时。在JSC模拟实验室和跨部门团队的协助下,选择了八个国际代码团队来利用此机会来调查和改善其应用程序可伸缩性。最终,七个团队的代码成功在完整的JUQUEEN系统上运行。 Code Saturne和Seven-League Hydro展示了强大的可伸缩性,它们都在每个计算节点上使用16个MPI进程的4个OpenMP线程,总共1 835 008个线程,使它们符合High-Q Club成员资格。现有成员CIAO和iFETI能够证明他们还有更多的求解器,它们的缩放比例也可以接受。此外,还通过CIAO模拟对大规模现场互动可视化进行了演示,该模拟使用458752 MPI流程运行在通过JUSITU耦合到VisIt的28个机架上。两种自适应网格细化实用程序ICI和p4est表明它们可以分别缩放以使用458 752和971 504 MPI等级运行,但是在加载大型网格时都遇到了问题。并行文件I / O问题也阻碍了PFLOTRAN的大规模执行。 NEST导入模块(加载并连接了1.9 TiB的神经元和突触数据)的性能较差,其原因是内部数据结构与HDF5文件对象不匹配,从而导致无法使用MPI集合文件读取,并且有望在纠正后启用该功能。大型神经网络仿真.2016年初,对High-Q Club的25条代码进行了比较分析,其中包括上次研讨会获得资格的5条代码。尽管结果复杂得多,但我们了解了更多有关应用程序文件I / O限制和效率低下的信息,这些仍然是大规模仿真的主要障碍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号