首页> 外文期刊>Journal of chemical theory and computation: JCTC >Full Parallel Implementation of an All-Electron Four-Component Dirac-Kohn-Sham Program
【24h】

Full Parallel Implementation of an All-Electron Four-Component Dirac-Kohn-Sham Program

机译:全电子四分量Dirac-Kohn-Sham程序的完全并行实现

获取原文
获取原文并翻译 | 示例
           

摘要

A full distfibuted-memory implementation of the Dirac—Kohn—Sham (DKS) module of the program BERTHA (Belpassi et al., Phys. Chem. Chem. Phys. 2011, 13, 12368—12394) is presented, where the self-consistent field (SCF) procedure is replicated on all the parallel processes, each process working on subsets of the global matrices. The key feature of the implementation is ah efficient procedure for switching between two matrix distribution schemes, one (integral-driven) optimal for the parallel computation of the matrix elements and another (block-cyclic) optimal for the parallel linear algebra operations. This approach, making both CPU-time and memory scalable with the number of processors used, virtually overcomes at once both time and memory barriers associated with DKS calculations. Performance, portability, and numerical stability of the code are illustrated on the basis of test calculations on three gold clusters of increasing size, an organometallic compound, and a perovskite model. The calculations are performed on a Beowulf and a BlueGene/Q, system.
机译:介绍了BERTHA程序的Dirac-Kohn-Sham(DKS)模块的完整分布式内存实施方式(Belpassi等人,Phys。Chem。Chem。Phys。2011,13,12368-12394),其中一致字段(SCF)过程在所有并行过程上复制,每个过程都在全局矩阵的子集上工作。该实现的关键特征是一种高效的过程,可在两种矩阵分配方案之间进行切换,一种方案(积分驱动)最优化用于矩阵元素的并行计算,另一种方案(块循环)最优用于并行线性代数运算。这种方法可以使CPU时间和内存随所使用的处理器数量而扩展,几乎可以同时克服与DKS计算相关的时间和内存障碍。在对三个尺寸不断增大的金簇,有机金属化合物和钙钛矿模型进行测试计算的基础上,说明了代码的性能,可移植性和数值稳定性。计算是在Beowulf和BlueGene / Q系统上进行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号