首页> 外文会议>International Conference on P2P, Parallel, Grid, Cloud and Internet Computing >A Study of Memory Consumption and Execution Performance of the cuFFT Library
【24h】

A Study of Memory Consumption and Execution Performance of the cuFFT Library

机译:袖口库的记忆消耗和执行性能研究

获取原文

摘要

The Fast Fourier Transform (FFT) is an essential primitive that has been applied in various fields of science and engineering. In this paper, we present a study of the Nvidia's cuFFT library - a proprietary FFT implementation for Nvidia's Graphics Processing Units - to identify the impact that two configuration parameters have in its execution. One useful feature of the cuFFT library is that it can be used to efficiently calculate several FFTs at once. In this work we analyse the effect this feature has on memory consumption and execution time in order to find a useful trade-off. Another important feature of the library is that it supports sophisticated input and output data layouts. This feature allows, for instance, to perform multidimensional FFT decomposition with no need of data transpositions. We have identified some patterns which may help to decide the parameters and values that are the key for achieving increased performance in a FFT calculation. We believe that this study will help researchers who wish to use the cuFFT library to decide what parameters values are best suited to achieve higher performance in their execution, both in time and memory consumption.
机译:快速的傅里叶变换(FFT)是应用于各种科学和工程领域的基本原始原始原始原始原态。在本文中,我们展示了NVIDIA的袖口库 - NVIDIA的图形处理单元的专有FFT实现 - 以确定两个配置参数在其执行中的影响。袖口库的一个有用特征是它可以用来一次有效地计算几个FFT。在这项工作中,我们分析了该功能对内存消耗和执行时间的影响,以找到有用的权衡。图书馆的另一个重要特征是它支持复杂的输入和输出数据布局。例如,此功能允许执行多维FFT分解,无需数据换位。我们已经确定了一些模式,这可能有助于确定用于在FFT计算中实现更高性能的关键的参数和值。我们认为,本研究将帮助希望使用袖口库来决定哪些参数值最适合在其执行中实现更高的性能的研究人员,这两者都在时间和内存消耗中实现更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号