...
首页> 外文期刊>Journal of supercomputing >Hybrid and 4-D FFT implementations of an open-source parallel FFT package OpenFFT
【24h】

Hybrid and 4-D FFT implementations of an open-source parallel FFT package OpenFFT

机译:开源并行FFT软件包OpenFFT的混合和4-D FFT实现

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The fast Fourier transform (FFT) is a fundamental kernel in a wide variety of science and engineering fields from electronic structure calculations to medical imaging. OpenFFT is an open-source package for parallel 3-D FFTs, built on a domain decomposition method with optimal communication for minimizing the volume of communication. However, OpenFFT version 1.1 does not support hybrid calculations for fully utilizing common multi-core machines and parallel 4-D FFTs. In addition, there exist several state-of-the-art open-source packages for parallel FFTs, and performance comparison among them would be interesting and helpful for potential users. In this paper, we (1) extend the functionality of OpenFFT by developing a hybrid MPI/OpenMP version and a parallel 4-D FFT, and (2) conduct performance comparison among the currently available parallel FFT packages. In the former, we first analyze the computational parts of OpenFFT to explore the opportunities for taking advantage of fine-grained parallelization with OpenMP. Based on the analysis, we implement and make comparison to choose between the two most promising hybrid options. We then develop the parallel 4-D FFT by extending the hybrid implementation of the parallel 3-D FFT. The decomposition method for 4-D FFTs preserves its 3-D original features to maximize the localization of data when transposing. In the latter, we evaluate and compare the performance of FFTE, P3DFFT, PFFT, 2DECOMP& FFT, and OpenFFT for both 3-D and 4-D FFTs on a number of different machines with varied computational scales. The evaluation results assert the benefit of the hybrid feature in improving the scalability of OpenFFT, and confirm its minimal volume of communication for 4-D FFTs in practice. Also, although no significant difference is observed in overall performance in general, there are specific cases when some packages have the edge over the others.
机译:快速傅立叶变换(FFT)是从电子结构计算到医学成像等众多科学和工程领域的基本内核。 OpenFFT是用于并行3-D FFT的开源软件包,它基于具有最佳通信的域分解方法构建,以最大程度地减少通信量。但是,OpenFFT 1.1版不支持混合计算以充分利用常见的多核计算机和并行4-D FFT。此外,还有一些用于并行FFT的最先进的开源程序包,它们之间的性能比较将很有趣,并且对潜在用户很有帮助。在本文中,我们(1)通过开发混合MPI / OpenMP版本和并行4-D FFT扩展了OpenFFT的功能,并且(2)在当前可用的并行FFT包之间进行性能比较。在前者中,我们首先分析OpenFFT的计算部分,以探索利用OpenMP进行细粒度并行化的机会。基于分析,我们实现并进行比较以在两个最有前途的混合选项之间进行选择。然后,我们通过扩展并行3-D FFT的混合实现来开发并行4-D FFT。 4-D FFT的分解方法保留了其3-D原始功能,以在转置时最大化数据的定位。在后者中,我们评估和比较了在具有不同计算规模的许多不同机器上针对3-D和4-D FFT的FFTE,P3DFFT,PFFT,2DECOMP&FFT和OpenFFT的性能。评估结果证明了混合功能在提高OpenFFT的可伸缩性方面的优势,并在实践中确认了其对于4-D FFT的最小通信量。另外,尽管总体上没有观察到明显的整体性能差异,但是在某些情况下某些包装的包装比其他包装有优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号