首页> 外文会议>Supercomputing, 1998. SC98. IEEE/ACM Conference on >Accurate Performance Evaluation, Modelling and Prediction of a Message Passing Simulation Code based on Middleware
【24h】

Accurate Performance Evaluation, Modelling and Prediction of a Message Passing Simulation Code based on Middleware

机译:基于中间件的消息传递仿真代码的准确性能评估,建模和预测

获取原文
获取外文期刊封面目录资料

摘要

In distributed and vectorized computing there is a large number of highly different supercomputing platforms an application could run on. Therefore most traditional parallel codes are ill equipped to collect data about their resource usage or their behavior at run time and the corresponding data are rarely published and few scientists attack the planning of an application and its platform systematically. As an improvement over the current state of the art, we propose an integrated approach to performance evaluation, modeling and prediction for different platforms. Our approach uses a combination of analytical modeling and systematically designed experimentation with full application runs, reduced application kernels and some benchmarks. We studied our methodology of performance assessment with Opal, an example code in molecular biology, developed at our institution to run on our four Cray J90 ``Classic'' Vector SMPs. Besides a detailed assessment of performance achieved on the J90s, the primary goal of our study was to find the most suitable and most cost effective hardware platform for the application, in particular to check the suitability of this application for slow CoPs, SMP CoPs and fast CoPs, three flavors of Clusters of PCs built with off-the-shelf Intel Pentium processors. A performance assessment based on our model is much easier than porting and parallelizing the application for a new target machine and so we could easily obtain and include performance estimates for a T3E-900, a high end MPP system. The predicted execution times and speedup figures indicate that a well designed cluster of PCs achieves similar if not better performance than the J90 vector processors currently used and that the computational efficiency compares favorably to the T3E-900 for that particular application code.
机译:在分布式和矢量化计算中,可以运行应用程序的大量高度不同的超级计算平台。因此,大多数传统的并行代码都无法在运行时收集有关其资源使用情况或行为的数据,并且很少发布相应的数据,很少有科学家系统地攻击应用程序及其平台的计划。作为对当前技术水平的改进​​,我们提出了一种针对不同平台的性能评估,建模和预测的集成方法。我们的方法结合了完整的应用程序运行,简化的应用程序内核和某些基准的分析建模和系统设计的实验。我们使用Opal(一种分子生物学示例代码)研究了我们的绩效评估方法,该蛋白是在我们机构开发的,可在我们的四个Cray J90``经典''载体SMP上运行。除了详细评估J90上获得的性能外,我们研究的主要目标是找到最适合该应用程序且最具成本效益的硬件平台,尤其是检查该应用程序对慢CoP,SMP CoP和快速CoP的适用性。 CoP,这是使用现成的Intel Pentium处理器构建的三种PC群集。基于我们的模型进行的性能评估比为新目标计算机移植和并行化应用程序容易得多,因此我们可以轻松获得并包括高端MPP系统T3E-900的性能评估。预测的执行时间和加速数据表明,精心设计的PC集群可以达到与当前使用的J90矢量处理器相似甚至更好的性能,并且该特定应用代码的计算效率可与T3E-900相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号