首页> 外文期刊>Journal of chemical theory and computation: JCTC >Optimization of the Coupled Cluster Implementation in NWChem on Petascale Parallel Architectures
【24h】

Optimization of the Coupled Cluster Implementation in NWChem on Petascale Parallel Architectures

机译:基于Petascale并行架构的NWChem中耦合集群实现的优化

获取原文
获取原文并翻译 | 示例
           

摘要

The coupled cluster singles and doubles (CCSD) algorithm in the NWChem software package has been optimized to alleviate the communication bottleneck. This optimization provided a 2-fold to 5-fold speedup in the CCSD iteration time depending on the problem size and available memory, and improved the CCSD scaling to 20 000 nodes of the NCSA Blue Waters supercomputer. On 20 000 XE6 nodes of Blue Waters, a complete conventional CCSD(T) calculation of a system encountering 1042 basis functions and 103 occupied correlated orbitals obtained a performance of 0.32 petaflop/s and took 5 h and 24 min to complete. The reported time and performance included all stages of the calculation from initialization to termination for iterative single and double excitations as well as perturbative triples correction. In perturbative triples alone, the computation sustained a rate of 1.18 petaflop/s. The CCSD and (T) phases took approximately 3/4 and 1/4 of the total time to solution, respectively, showing that CCSD is the most time-consuming part at the large scale. The MP2, CCSD, and CCSD(T) computations in 6-311++G~(**) basis set performed on guanine-cytosine deoxydinucleotide monophosphate probed the conformational energy difference between the A- and B-conformations of single stranded DNA Good agreement between MP2 and coupled cluster methods has been obtained, suggesting the utility of MP2 for conformational analysis in these systems. The study revealed a significant discrepancy between the quantum mechanical and classical force field predictions, suggesting a need to improve the dihedral parameters.
机译:NWChem软件包中的耦合群集单打和双打(CCSD)算法已经过优化,可缓解通信瓶颈。此优化根据问题的大小和可用内存,使CCSD迭代时间加快了2到5倍,并且将CCSD扩展到了NCSA Blue Waters超级计算机的2万个节点。在Blue Waters的20000 XE6节点上,系统的完整常规CCSD(T)计算遇到1042个基函数和103个占用的相关轨道,其性能为0.32 petaflop / s,完成时间为5小时24分钟。报告的时间和性能包括计算的所有阶段,从初始化到迭代单次和双次激发的终止以及微扰三元校正。仅在扰动三元组中,计算就保持了1.18 petaflop / s的速率。 CCSD和(T)阶段分别花费了总时间的3/4和1/4,这表明CCSD在大规模上是最耗时的部分。在鸟嘌呤-胞嘧啶脱氧二核苷酸单磷酸酯的6-311 ++ G〜(**)基集中进行的MP2,CCSD和CCSD(T)计算可探测单链DNA的A和B构象之间的构象能差。已获得MP2和耦合簇方法之间的一致性,这表明MP2在这些系统中进行构象分析的实用性。该研究揭示了量子力学和经典力场预测之间的显着差异,表明需要改善二面体参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号