首页> 外文学位 >Efficient communication algorithms for parallel computing platforms.
【24h】

Efficient communication algorithms for parallel computing platforms.

机译:适用于并行计算平台的高效通信算法。

获取原文
获取原文并翻译 | 示例

摘要

High Performance Computing (HPC) platforms are used for various applications. In these platforms, processor speed has increased rapidly. However, data communication speed among processor, memory, and disk has not kept pace. Thus, efficient communication algorithms are critical for effective utilization of HPC platforms.; Our work focuses on the design of efficient communication algorithms on HPC platforms. To design efficient communication algorithms, we first design a simple and accurate model of HPC platforms. We identify three main costs on HPC platforms: processor-processor, memory-disk, and processor-memory communication costs. Among these, we investigate the first two.; We design communication algorithms using the above model. First, we develop a set of communication algorithms for the software task pipeline which consists of several stages of processors. The general communication for the software task pipeline is M-to-N K-block-cyclic communication, where M is the number of source processors, N is the number of destination processors, and K is the number of consecutive blocks that need to be sent to the same processor. Our algorithm for the communication reduces the number of communication steps to as small as lg(N/M + 1) whereas a previous serial communication takes MN communication steps. Our experimental results show that the number of processors required to process Synthetic Aperture Radar (SAR) data is reduced by as much as 50%.; The second class of algorithms is memory-disk communication algorithms. In this research, several algorithms for memory-disk communications are designed: all-to-all broadcast communication and matrix transpose. The results show that the execution time of a matrix transpose on IBM SP2 is reduced by as much as 31.2% when the data size is 64 MBytes and the number of processors is one. The execution time of the all-to-all broadcast communication is reduced by as much as 86% on SGI/Cray T3E when the number of processors is 64 and the data size is 256 KBytes per processor.; Finally, several benchmarks that measure HPC performance are implemented. We choose a recently proposed benchmark to measure the real-time performance. We implement it using our communication algorithms and the previous serial algorithm. Also, we implement our low-level benchmark on HPC platforms.
机译:高性能计算(HPC)平台用于各种应用程序。在这些平台中,处理器速度迅速提高。但是,处理器,内存和磁盘之间的数据通信速度并未保持同步。因此,有效的通信算法对于有效利用HPC平台至关重要。我们的工作重点是在HPC平台上设计高效的通信算法。为了设计有效的通信算法,我们首先设计一个简单而准确的HPC平台模型。我们确定了HPC平台上的三个主要成本:处理器-处理器,内存磁盘和处理器-内存通信成本。其中,我们调查了前两个。我们使用以上模型设计通信算法。首先,我们为软件任务管道开发了一套通信算法,该算法由多个阶段的处理器组成。软件任务管道的常规通信是 M -to- N K -块循环通信,其中 M 是源处理器的数量, N 是目标处理器的数量, K 是需要发送到同一处理器的连续块的数量。我们的通信算法将通信步骤的数量减少到lg( N / M + 1),而先前的串行通信则采用 MN 交流步骤。我们的实验结果表明,处理合成孔径雷达(SAR)数据所需的处理器数量减少了多达50%。第二类算法是存储磁盘通信算法。在这项研究中,设计了几种用于存储磁盘通信的算法:全广播广播通信和矩阵转置。结果表明,当数据大小为64 MB且处理器数量为1时,在IBM SP2上转置矩阵的执行时间减少了多达31.2%。当处理器数量为64个且每个处理器的数据大小为256 KB时,在SGI / Cray T3E上,所有广播广播的执行时间最多减少86%。最后,实施了一些衡量HPC性能的基准。我们选择了最近提出的基准来衡量实时性能。我们使用我们的通信算法和以前的串行算法来实现它。此外,我们在HPC平台上实施了低级基准测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号