首页> 外文OA文献 >Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing

【2h】

Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing

机译：在超大型并行机上实现高性能：性能预测和负载均衡

页面导航

摘要
著录项
相似文献
相关主题

摘要

Parallel machines with an extremely large number of processors (at least tens of thousands processors) are now in operation. For example, the IBM BlueGene/L machine with 128K processors is currently being deployed. It is going to be a significant challenge for application developers to write parallel programs in order to exploit the enormous compute power available and manually scale their applications on such machines. Solving these problems involves finding suitable parallel programming models for such machines and addressing issues like load imbalance. In this thesis, we explore Charm++ programming model and its migratable objects for programming such machines and dynamic load balancing techniques to help parallel applications to easily scale on a large number of processors. We also present a parallel simulator that is capable of predicting parallel performance to help analysis and tuning of the parallel performance and facilitate the development of new load balancing techniques, even before such machines are built.We evaluate the idea of virtualization and its usefulness in helping a programmer to write applications with high degree of parallelism. We demonstrate it by developing several mini-applications with million-way parallelism. We show that Charm++ and AMPI (an extension to MPI) with migratable objects and support for load balancing are suitable programming model for programming very large machines.It is important to understand the performance of parallel applications on very large parallel machines. This thesis explores Parallel Discrete Event Simulation (PDES) techniques with an optimistic synchronization protocol to simulate parallel applications running on a very large number of processors. We optimize the synchronization protocol by exploiting the inherent determinacy that is normally found in parallel applications to reduce the synchronization overhead significantly.

机译：具有大量处理器（至少数万个处理器）的并行计算机现在正在运行。例如，当前正在部署具有128K处理器的IBM BlueGene / L计算机。对于应用程序开发人员而言，编写并行程序以利用可用的巨大计算能力并在此类计算机上手动扩展其应用程序将是一个巨大的挑战。解决这些问题涉及为此类机器找到合适的并行编程模型，并解决诸如负载不平衡之类的问题。在本文中，我们探索了Charm ++编程模型及其可移植对象，以对此类机器和动态负载平衡技术进行编程，以帮助并行应用程序轻松地在大量处理器上进行扩展。我们还提供了一种并行仿真器，该仿真器能够预测并行性能，以帮助分析和调整并行性能，甚至在构建此类负载均衡器之前就促进了新的负载平衡技术的开发。程序员编写高度并行的应用程序。我们通过开发具有百万程并行性的几个小型应用程序来演示它。我们展示了具有可迁移对象的Charm ++和AMPI（对MPI的扩展）以及对负载平衡的支持，是对大型计算机进行编程的合适编程模型。了解大型并行计算机上并行应用程序的性能非常重要。本文探讨了采用乐观同步协议的并行离散事件模拟（PDES）技术，以模拟运行在大量处理器上的并行应用程序。我们利用并行应用程序中通常存在的固有确定性来优化同步协议，以显着减少同步开销。

著录项

作者
Zheng Gengbin;
展开▼
作者单位

展开▼
年度 2005
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Performance Evaluation of Dynamic Load Balancing Scheme with Load Prediction Mechanism Using the Load Growing Acceleration for Massively Parallel Computers [J] . Atsushi Nunome, Hiroaki Hirata, Haruo Niimi, Systems and Computers in Japan . 2004,第11期

机译：大规模并行计算机基于负载预测机制的带有负载预测机制的动态负载均衡方案的性能评估
2. A highly OR-parallel inference machine (Multi-ASCA) and its performance evaluation: an architecture and its load balancing algorithms [J] . Naganuma J., Ogura T. IEEE Transactions on Computers . 1994,第9期

机译：高度或并行的推理机（Multi-ASCA）及其性能评估：体系结构及其负载平衡算法
3. A Web Clustered System for Achieving Higher Performances through Load Balancing Mechanism [J] . Young Rhee American Journal of Industrial and Business Management . 2013,第3期

机译：通过负载平衡机制实现更高性能的Web集群系统
4. BigSim: a parallel simulator for performance prediction of extremely large parallel machines [C] . Zheng, G., Gunavardhan Kakulapati, . 2004

机译：BigSim：并行模拟器，用于预测超大型并行机的性能
5. Achieving high performance on extremely large parallel machines: Performance prediction and load balancing. [D] . Zheng, Gengbin. 2005

机译：在超大型并行机上实现高性能：性能预测和负载平衡。
6. Ultra-deep massively parallel sequencing with unique molecular identifier tagging achieves comparable performance to droplet digital PCR for detection and quantification of circulating tumor DNA from lung cancer patients [O] . Le Son Tran, Hong-Anh Thi Pham, Vu-Uyen Tran, 2019

机译：具有独特分子标识符标记的超深度平行测序可与肺癌患者循环肿瘤DNA的检测和定量液滴数字PCR实现了相当的性能
7. Bigsim: A parallel simulator for performance prediction of extremely large parallel machines [O] . Gengbin Zheng, Gunavardhan Kakulapati, Laxmikant V. Kalé 2004

机译：Bigsim：并行模拟器，用于预测超大型并行机的性能

Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing

摘要

著录项

相似文献

相关主题

期刊订阅