首页> 外文OA文献 >Efficient parallel implementation of the ChASE library on distributed CPU-GPU architectures
【2h】

Efficient parallel implementation of the ChASE library on distributed CPU-GPU architectures

机译:ChASE库在分布式CPU-GPU架构上的高效并行实现

摘要

The Chebyshev Accelerated Subspace iteration Eigensolver (ChASE) is an iterative eigensolver developed at the JSC by the SimLab ab initio. The solver target principally sequences of dense eigenvalue problems as they arise in Density functional Theory, but can also work on the single eigenproblem. ChASE leverages on the preponderant use of BLAS 3 subroutines to achieve close-to-peak performance. Currently, the library can be executed in parallel on many- and multi-core platforms. The latest development of this project dealt with the extension of the CUDA build to encompass multiple GPUs on distinct CPUs. As such this hybrid parallelization will use MPI as well as CUDA interfaces effectively exploiting heterogeneous multi-GPU platforms. The extended library was tested on large and dense eigenproblems extracted from excitonic Hamiltonian. The ultimate goal is to integrate this new parallel implementation of ChASE with the VASP-BSE code.
机译:Chebyshev加速子空间迭代本征求解器(ChASE)是由SimLab从头开始在JSC开发的迭代本征求解器。求解器主要针对在密度泛函理论中出现的密集特征值问题序列,但也可以解决单个特征问题。 ChASE充分利用BLAS 3子例程来实现接近峰值的性能。当前,该库可以在多核和多核平台上并行执行。该项目的最新发展涉及CUDA构建的扩展,以涵盖不同CPU上的多个GPU。因此,这种混合并行化将使用MPI以及CUDA接口来有效利用异构的多GPU平台。在从激子哈密顿量中提取的大而密集的本征问题上测试了扩展库。最终目标是将ChASE的这种新的并行实现与VASP-BSE代码集成在一起。

著录项

  • 作者

    Di Napoli Edoardo;

  • 作者单位
  • 年度 2016
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号