首页> 外文期刊>Scientific programming >HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi
【24h】

HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi

机译:在具有MAGMA端口的至强融核的英特尔多核内核硬件上进行HPC编程

获取原文
获取原文并翻译 | 示例

摘要

This paper presents the design and implementation of several fundamental dense linear algebra (DLA) algorithms for multicore with Intel Xeon Phi coprocessors. In particular, we consider algorithms for solving linear systems. Further, we give an overview of the MAGMA MIC library, an open source, high performance library, that incorporates the developments presented here and, more broadly, provides the DLA functionality equivalent to that of the popular LAPACK library while targeting heterogeneous architectures that feature a mix of multicore CPUs and coprocessors. The LAPACK-compliance simplifies the use of the MAGMA MIC library in applications, while providing them with portably performant DLA. High performance is obtained through the use of the high-performance BLAS, hardware-specific tuning, and a hybridization methodology whereby we split the algorithm into computational tasks of various granularities. Execution of those tasks is properly scheduled over the heterogeneous hardware by minimizing data movements and mapping algorithmic requirements to the architectural strengths of the various heterogeneous hardware components. Our methodology and programming techniques are incorporated into the MAGMA MIC API, which abstracts the application developer from the specifics of the Xeon Phi architecture and is therefore applicable to algorithms beyond the scope of DLA.
机译:本文介绍了使用英特尔至强融核协处理器的几种用于多核的基本密集线性代数(DLA)算法的设计和实现。特别地,我们考虑用于求解线性系统的算法。此外,我们概述了MAGMA MIC库,这是一个开放源代码的高性能库,它结合了此处介绍的开发,并且更广泛地提供了与流行的LAPACK库等效的DLA功能,同时针对具有以下特点的异构体系结构:多核CPU和协处理器的混合。符合LAPACK的特性简化了MAGMA MIC库在应用程序中的使用,同时为它们提供了可移植的高性能DLA。通过使用高性能BLAS,特定于硬件的调优和混合方法,可以获得高性能,从而将算法分为各种粒度的计算任务。通过最小化数据移动并将算法要求映射到各种异构硬件组件的体系结构优势,可以在异构硬件上适当地安排这些任务的执行。我们的方法和编程技术已整合到MAGMA MIC API中,从而使应用程序开发人员从至强融核架构的细节中抽象出来,因此适用于DLA范围以外的算法。

著录项

  • 来源
    《Scientific programming》 |2015年第2015期|502593.1-502593.11|共11页
  • 作者单位

    Univ Tennessee, Knoxville, TN 37996 USA|Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA|Univ Manchester, Manchester M13 9PL, Lancs, England;

    Univ Tennessee, Knoxville, TN 37996 USA;

    Univ Tennessee, Knoxville, TN 37996 USA;

    Univ Tennessee, Knoxville, TN 37996 USA;

    Univ Tennessee, Knoxville, TN 37996 USA;

    Univ Tennessee, Knoxville, TN 37996 USA;

    Univ Tennessee, Knoxville, TN 37996 USA;

  • 收录信息 美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号