首页> 外文期刊>Journal of Parallel and Distributed Computing >Exploiting hierarchy parallelism for molecular dynamics on a petascale heterogeneous system
【24h】

Exploiting hierarchy parallelism for molecular dynamics on a petascale heterogeneous system

机译:在Petascale异构系统上利用层次结构并行性进行分子动力学

获取原文
获取原文并翻译 | 示例

摘要

Heterogeneous systems with nodes containing more than one type of computation units, e.g., central processing units (CPUs) and graphics processing units (CPUs), are becoming popular because of their low cost and high performance. In this paper, we have developed a Three-Level Parallelization Scheme (TLPS) for molecular dynamics (MD) simulation on heterogeneous systems. The scheme exploits multilevel parallelism combining (1) inter-node parallelism using spatial decomposition via message passing, (2) intra-node parallelism using spatial decomposition via dynamically scheduled multi-threading, and (3) intra-chip parallelism using multi-threading and short vector extension in CPUs, and employing multiple CUDA threads in CPUs. By using a hierarchy of parallelism with optimizations such as communication hiding intra-node, and memory optimizations in both CPUs and CPUs, we have implemented and evaluated a MD simulation on a petascale heterogeneous supercomputer TH-1A. The results show that MD simulations can be efficiently parallelized with our TLPS scheme and can benefit from the optimizations.
机译:具有其节点包含一种以上类型的计算单元(例如,中央处理单元(CPU)和图形处理单元(CPU))的异构系统由于其低成本和高性能而变得流行。在本文中,我们为异构系统上的分子动力学(MD)模拟开发了三级并行化方案(TLPS)。该方案利用了多级并行度,其结合了(1)通过消息传递使用空间分解的节点间并行度,(2)通过动态调度的多线程使用空间分解的节点内并行度以及(3)使用多线程和CPU中的短向量扩展,并在CPU中使用多个CUDA线程。通过将并行性层次结构与优化(例如,隐藏节点内部的通信以及CPU和CPU中的内存优化)一起使用,我们已经在Petascale异构超级计算机TH-1A上实现并评估了MD仿真。结果表明,MD仿真可以与我们的TLPS方案有效地并行化,并且可以从优化中受益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号