首页> 外文会议>Asian Conference on Supercomputing Frontiers >Performance Evaluation and Analysis of Linear Algebra Kernels in the Prototype Tianhe-3 Cluster
【24h】

Performance Evaluation and Analysis of Linear Algebra Kernels in the Prototype Tianhe-3 Cluster

机译:原型天河3集群线性代数内核的性能评估与分析

获取原文
获取外文期刊封面目录资料

摘要

As the supercomputing system entering the exascale era, power consumption becomes a major concern in the system design. Among all the novel techniques for reducing power consumption, ARM architecture is gaining popularity in the HPC community due to its low power footprint and high energy efficiency. As one of the initiatives for addressing the exascale challenges in China, Tianhe-3 supercomputer has adopted the technology roadmap of using the many-core ARM architecture with home-built phytium-2000+ and matrix-2000+ processors. In this paper, we evaluate several linear algebra kernels such as matrix-matrix multiplication, matrix-vector multiplication and triangular solver with both sparse and dense datasets. These linear algebra kernels are good performance indicators of the prototype Tianhe-3 cluster. Comprehensive analysis is performed using roofline model to identify the directions for performance optimization from both hardware and software perspectives. In addition, we compare the performance of phytium-2000+ and matrix-2000+ with widely used KNL processor. We believe this paper provides valuable experiences and insights as work-in-progress towards exascale for the HPC community.
机译:作为进入ExaScale时代的超级计算系统,功耗成为系统设计中的主要问题。在降低功耗的所有新颖技术中,由于其低功耗占地面积和高能量效率,ARM架构在HPC社区中受到普及。作为解决中国外国人挑战的倡议之一,天河3超级计算机采用了使用家庭式植物 - 2000 +和矩阵-2000 +处理器的多芯臂架构的技术路线图。在本文中,我们评估了诸如矩阵矩阵乘法,矩阵 - 向量乘法和三角形求解器等若干线性代数核,其稀疏和密集的数据集。这些线性代数内核是原型天河3集群的良好性能指标。使用屋顶模型进行综合分析,以识别硬件和软件透视图中性能优化的方向。此外,我们将植物-2000 +和矩阵-2000 +与广泛使用的KNL处理器进行比较。我们认为本文提供了有价值的经验和洞察力为惠普社区的exascale的过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号