...
首页> 外文期刊>The journal of physical chemistry, A. Molecules, spectroscopy, kinetics, environment, & general theory >Massively Parallel Implementation of Explicitly Correlated Coupled-Cluster Singles and Doubles Using TiledArray Framework
【24h】

Massively Parallel Implementation of Explicitly Correlated Coupled-Cluster Singles and Doubles Using TiledArray Framework

机译:使用TiledArray框架大规模并行实现显式相关的耦合集群单打和双打

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

A new distributed-memory massively parallel implementation of standard and explicitly correlated (F12) coupled-cluster singles and doubles (CCSD) with canonical O(N-6) computational complexity is described. The implementation is based on the TiledArray tensor framework. Novel features of the implementation include (a) all data greater than O(N) is distributed in memory and (b) the mixed use of density fitting and integral-driven formulations that optionally allows to avoid storage of tensors with three and four unoccupied indices. Excellent strong scaling is demonstrated on a multicore shared-memory computer, a commodity distributed-memory computer, and a national-scale super-computer. The performance on a shared-memory computer is competitive with the popular CCSD implementations in ORCA and Psi4. Moreover, the CCSD performance on a commodity-size cluster significantly improves on the state-of-the-art package NWChem. The large-scale parallel explicitly correlated coupled-cluster implementation makes routine accurate estimation of the coupled-cluster basis set limit for molecules with 20 or more atoms. Thus, it can provide valuable benchmarks for the merging reduced-scaling coupled-cluster approaches. The new implementation allowed us to revisit the basis set limit for the CCSD contribution to the binding energy of pi-stacked uracil dimer, a challenging paradigm of pi-stacking interactions from the S66 benchmark database. The revised value for the CCSD correlation binding energy obtained with the help of quadruple-zeta CCSD computations, -8.30 +/- 0.02 kcal/mol, is significantly different from the S66 reference value, -8.50 kcal/mol, as well as other CBS limit estimates in the recent literature.
机译:描述了具有标准O(N-6)计算复杂度的标准和显式相关(F12)耦合群集单打和双打(CCSD)的新的分布式内存大规模并行实现。该实现基于TiledArray张量框架。实现的新颖性包括:(a)所有大于O(N)的数据都分布在内存中;(b)密度拟合和积分驱动公式的混合使用,可以选择避免存储具有三个和四个未占用索引的张量。在多核共享内存计算机,商品分布式内存计算机和国家级超级计算机上展示了出色的强大伸缩性。共享内存计算机上的性能与ORCA和Psi4中流行的CCSD实现相比具有竞争力。此外,与最新包装NWChem相比,在商品规模集群上的CCSD性能有了显着提高。大规模并行显式相关的耦合簇实现可对具有20个或更多原子的分子进行常规的耦合簇基础集极限的准确估计。因此,它可以为合并缩减规模的耦合集群方法提供有价值的基准。新的实现方式使我们可以重新考虑CCSD对pi堆叠尿嘧啶二聚体的结合能的贡献的基本极限,这是S66基准数据库中pi堆叠相互作用的具有挑战性的范例。借助四重CCSD计算获得的CCSD相关结合能的修正值-8.30 +/- 0.02 kcal / mol与S66参考值-8.50 kcal / mol以及其他CBS显着不同最近文献中的极限估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号