首页> 外文期刊>IBM Journal of Research and Development >Scalable framework for 3D FFTs on the Blue Gene/L supercomputer: Implementation and early performance measurements
【24h】

Scalable framework for 3D FFTs on the Blue Gene/L supercomputer: Implementation and early performance measurements

机译:Blue Gene / L超级计算机上3D FFT的可扩展框架:实现和早期性能测量

获取原文
           

摘要

This paper presents results on a communications-intensive kernel, the three-dimensional fast Fourier transform (3D FFT), running on the 2,048-node Blue Gene®/L (BG/L) prototype. Two implementations of the volumetric FFT algorithm were characterized, one built on the Message Passing Interface library and another built on an active packet Application Program Interface supported by the hardware bring-up environment, the BG/L advanced diagnostics environment. Preliminary performance experiments on the BG/L prototype indicate that both of our implementations scale well up to 1,024 nodes for 3D FFTs of size 128 × 128 × 128. The performance of the volumetric FFT is also compared with that of the Fastest Fourier Transform in the West (FFTW) library. In general, the volumetric FFT outperforms a port of the FFTW Version 2.1.5 library on large-node-count partitions.
机译:本文介绍了在通信密集型内核三维快速傅里叶变换(3D FFT)上的结果,该内核在2,048个节点的BlueGene®/ L(BG / L)原型上运行。表征了体积FFT算法的两种实现,一种基于消息传递接口库,另一种基于硬件激活环境BG / L高级诊断环境支持的活动数据包应用程序接口。在BG / L原型上进行的初步性能实验表明,对于大小为128×128×128的3D FFT,我们的两种实现都可以扩展到1,024个节点。在FFT中,还将体积FFT的性能与最快傅立叶变换的性能进行比较。西部(FFTW)库。通常,在大节点数分区上,体积FFT的性能优于FFTW 2.1.5版库的端口。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号