首页> 外文会议>International Conference on P2P, Parallel, Grid, Cloud and Internet Computing >A Study of Memory Consumption and Execution Performance of the cuFFT Library
【24h】

A Study of Memory Consumption and Execution Performance of the cuFFT Library

机译:cuFFT库的内存消耗和执行性能的研究

获取原文

摘要

The Fast Fourier Transform (FFT) is an essential primitive that has been applied in various fields of science and engineering. In this paper, we present a study of the Nvidia's cuFFT library - a proprietary FFT implementation for Nvidia's Graphics Processing Units - to identify the impact that two configuration parameters have in its execution. One useful feature of the cuFFT library is that it can be used to efficiently calculate several FFTs at once. In this work we analyse the effect this feature has on memory consumption and execution time in order to find a useful trade-off. Another important feature of the library is that it supports sophisticated input and output data layouts. This feature allows, for instance, to perform multidimensional FFT decomposition with no need of data transpositions. We have identified some patterns which may help to decide the parameters and values that are the key for achieving increased performance in a FFT calculation. We believe that this study will help researchers who wish to use the cuFFT library to decide what parameters values are best suited to achieve higher performance in their execution, both in time and memory consumption.
机译:快速傅立叶变换(FFT)是已在科学和工程学的各个领域中应用的基本原语。在本文中,我们对Nvidia的cuFFT库(一种用于Nvidia图形处理单元的专有FFT实现)进行了研究,以确定两个配置参数对其执行的影响。 cuFFT库的一个有用的功能是,它可以用于一次高效地计算多个FFT。在这项工作中,我们分析了此功能对内存消耗和执行时间的影响,以便找到有用的折衷方案。该库的另一个重要功能是它支持复杂的输入和输出数据布局。例如,此功能允许执行多维FFT分解而无需数据转置。我们已经确定了一些模式,这些模式可能有助于确定参数和值,这些参数和值是在FFT计算中实现更高性能的关键。我们相信,这项研究将帮助希望使用cuFFT库的研究人员确定哪些参数值最适合在时间和内存消耗方面实现更高的执行性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号