首页> 外文期刊>Journal of computational science >Development of element-by-element kernel algorithms in unstructured finite-element solvers for many-core wide-SIMD CPUs:Application to earthquake simulation
【24h】

Development of element-by-element kernel algorithms in unstructured finite-element solvers for many-core wide-SIMD CPUs:Application to earthquake simulation

机译:用于许多核心宽-IMD CPU的非结构化有限元求解器元素内核算法的开发:地震仿真应用

获取原文
获取原文并翻译 | 示例
       

摘要

Acceleration of the element-by-element (EBE) kernel in matrix-vector products is essential for high-performance in unstructured implicit finite-element applications. However, the EBE kernel is not straightforward to attain high performance due to random data access with data recurrence. In this paper, we develop methods to circumvent these data races for high performance on many-core CPU architectures with wide SIMD units. The developed EBE kernel attains 16.3% and 16.0% of FP32 peak on Intel Xeon Phi (Knights Landing) based Oakforest-PACS and Intel Xeon Platinum (Cascade Lake) based Oakbridge-CX, respectively. This leads to 2.88-fold speedup over the baseline kernel and 2.03-fold speedup of the whole finite-element application on Oakforest-PACS. Examples of finite-element earthquake simulations using the developed EBE kernel algorithms are shown. These insights are expected to enable high performance on other unstructured finite-element solvers on large-scale many-core wide-SIMD CPU based systems. (C) 2020 Elsevier B.V. All rights reserved.
机译:在矩阵矢量产品中的逐个元素(EBE)内核的加速对于非结构化隐式有限元应用中的高性能至关重要。但是,由于随机数据访问,EBE内核并不直接实现高性能。在本文中,我们开发了在许多核心CPU架构上绕过这些数据比赛的方法,以宽的SIMD单元。开发的EBE内核分别基于英特尔Xeon Phi(骑士登陆)的Oakforest-PACS和英特尔Xeon Platinum(Cascade Lake)的Oakbridge-CX,获得16.3%和16.0%的FP32峰值。这导致基线内核加速2.88倍,并在Oakforest-PAC上的整个有限元应用的2.03倍的加速。示出了使用开发的EBE内核算法的有限元地震模拟的示例。预计这些见解将在大型许多核心宽SIMD CPU系统上对其他非结构化有限元求解器进行高性能。 (c)2020 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《Journal of computational science》 |2020年第9期|101174.1-101174.11|共11页
  • 作者单位

    Univ Tokyo Earthquake Res Inst 1-1-1 Yayoi Bunkyo Tokyo 1130032 Japan|Univ Tokyo Dept Civil Engn 1-1-1 Yayoi Bunkyo Tokyo 1130032 Japan|RIKEN Ctr Computat Sci Wako Saitama Japan;

    Intel Corp Graph & Software Grp Intel Architecture Tokyo Japan;

    Univ Tokyo Earthquake Res Inst 1-1-1 Yayoi Bunkyo Tokyo 1130032 Japan|Univ Tokyo Dept Civil Engn 1-1-1 Yayoi Bunkyo Tokyo 1130032 Japan|RIKEN Ctr Computat Sci Wako Saitama Japan|RIKEN Ctr Adv Intelligence Project Wako Saitama Japan;

    Intel Corp Data Ctr Grp Bend OR USA;

    RIKEN Ctr Computat Sci Wako Saitama Japan|Univ Tokyo Informat Technol Ctr Tokyo Japan;

    Japan Agcy Marine Earth Sci & Technol Res Inst Value Added Informat Generat Yokosuka Kanagawa Japan;

    Univ Tokyo Earthquake Res Inst 1-1-1 Yayoi Bunkyo Tokyo 1130032 Japan|Univ Tokyo Dept Civil Engn 1-1-1 Yayoi Bunkyo Tokyo 1130032 Japan|RIKEN Ctr Computat Sci Wako Saitama Japan;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Finite-element method; Random data access; Many-core; SIMD; Time-parallel algorithm;

    机译:有限元方法;随机数据访问;许多核心;SIMD;时间并行算法;
  • 入库时间 2022-08-18 21:31:47

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号