首页> 外文会议>International Conference on Ubiquitous Information Management and Communication >Performance Enhancement of GPU Parallel Computing Using Memory Allocation Optimization
【24h】

Performance Enhancement of GPU Parallel Computing Using Memory Allocation Optimization

机译:使用内存分配优化增强GPU并行计算的性能

获取原文

摘要

The Fourier transform converts a signal from its original domain to a representation in the frequency domain. Applications of the Fourier Transform are far-reaching, spanning fields such as intelligent information processing, machine vision, physics, mathematics, medical science, and telecommunications; hence, its applications have become an indispensable part in our daily life. Therefore, it is essential to construct efficient and high-reliability schemes to guarantee smooth performance of the systems using Fourier Transforms. This study compares performances of Fast Fourier Transforms on a host CPU, GPU parallel computing, and GPU parallel computing with memory allocation optimization. From the experimental results, GPU parallel computing is proven to be effective in enhancing computation speed of the FFT; the speedup ratio of GPU parallel computing over the CPU can reach 48 when operating on 32678 8-byte complex input data. In addition, by optimizing GPU memory allocation, the computation speed of the FFT can be further enhanced; the speedup ratio of GPU parallel computing with memory allocation optimization over the CPU can reach 114.7 when operating on 32678 8-byte complex input data.
机译:傅立叶变换将信号从其原始域转换为频域表示。傅里叶变换的应用范围很广,涵盖了智能信息处理,机器视觉,物理,数学,医学和电信等领域。因此,其应用已成为我们日常生活中不可或缺的一部分。因此,构建高效且高可靠性的方案以使用傅里叶变换来保证系统的平稳性能至关重要。这项研究比较了快速傅里叶变换在主机CPU,GPU并行计算和具有内存分配优化功能的GPU并行计算上的性能。从实验结果来看,GPU并行计算被证明可以有效地提高FFT的计算速度。当对32678个8字节的复杂输入数据进行操作时,GPU并行计算在CPU上的加速比可以达到48。另外,通过优化GPU内存分配,可以进一步提高FFT的计算速度。当对32678个8字节的复杂输入数据进行操作时,通过CPU进行内存分配优化的GPU并行计算的加速比可以达到114.7。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号