...
首页> 外文期刊>Computers and Electrical Engineering >SCALEABLE PARALLEL MODEL DEVELOPMENT AND PERFORMANCE ANALYSIS FOR MIMD MOLECULAR DYNAMICS SIMULATIONS
【24h】

SCALEABLE PARALLEL MODEL DEVELOPMENT AND PERFORMANCE ANALYSIS FOR MIMD MOLECULAR DYNAMICS SIMULATIONS

机译:分子动力学模拟的可伸缩并行模型开发和性能分析

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

A scaleable parallel model developed specifically for molecular dynamics simulations is described. The description encompasses both hardware and software. It is mapped onto a transputer array, programmed, and used to verify concepts and gather experimental data. Simulations are run with sample sizes ranging from 1024 to 42,546 particles on arrays from 4 to 128 processors. Performance data is generated by fixing the sample size, changing the number of node processors used to run the simulation, and measuring execution speeds. The parallel implementation is dissected and used to develop a detailed mathematical performance model. The methodology for developing the mathematical model is described. Performance equations are derived for all computation and communication routines in the code, and combined to form a modular system performance estimator. Accuracy of the mathematical model is verified by comparing the measured baseline times against a set of performance estimation curves generated by the mathematical formulation. The computation and communication constraints to faster execution are identified and discussed, including the issues of load balancing, data partitioning strategy, and upper and lower limits on the number of processors that should be used for a simulation. Algorithm and architectural improvements are described, and their impact on performance predicted using the analytical tools developed.
机译:描述了专门为分子动力学模拟开发的可缩放并行模型。该描述包含硬件和软件。它被映射到晶片机阵列上,进行了编程,并用于验证概念并收集实验数据。在4到128个处理器的阵列上,使用样本大小范围从1024到42,546的粒子运行模拟。通过固定样本大小,更改用于运行模拟的节点处理器的数量以及测量执行速度来生成性能数据。剖析了并行实现,并将其用于开发详细的数学性能模型。描述了开发数学模型的方法。针对代码中的所有计算和通信例程推导了性能方程,并将其组合起来以形成模块化的系统性能估计器。通过将测得的基准时间与数学公式生成的一组性能估计曲线进行比较,可以验证数学模型的准确性。确定并讨论了加快执行的计算和通信约束,包括负载平衡,数据分区策略以及用于仿真的处理器数量的上限和下限。描述了算法和体系结构的改进,并使用开发的分析工具预测了它们对性能的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号