首页> 外文会议>International Conference on Ubiquitous Information Management and Communication >Performance Enhancement of GPU Parallel Computing Using Memory Allocation Optimization
【24h】

Performance Enhancement of GPU Parallel Computing Using Memory Allocation Optimization

机译:使用内存分配优化的GPU并行计算性能提升

获取原文

摘要

The Fourier transform converts a signal from its original domain to a representation in the frequency domain. Applications of the Fourier Transform are far-reaching, spanning fields such as intelligent information processing, machine vision, physics, mathematics, medical science, and telecommunications; hence, its applications have become an indispensable part in our daily life. Therefore, it is essential to construct efficient and high-reliability schemes to guarantee smooth performance of the systems using Fourier Transforms. This study compares performances of Fast Fourier Transforms on a host CPU, GPU parallel computing, and GPU parallel computing with memory allocation optimization. From the experimental results, GPU parallel computing is proven to be effective in enhancing computation speed of the FFT; the speedup ratio of GPU parallel computing over the CPU can reach 48 when operating on 32678 8-byte complex input data. In addition, by optimizing GPU memory allocation, the computation speed of the FFT can be further enhanced; the speedup ratio of GPU parallel computing with memory allocation optimization over the CPU can reach 114.7 when operating on 32678 8-byte complex input data.
机译:傅立叶变换将来自原始域的信号转换为频域中的表示。傅里叶变换的应用深远,跨越字段,如智能信息处理,机器视觉,物理,数学,医学和电信;因此,其应用成为我们日常生活中不可或缺的部分。因此,必须构建有效和高可靠性方案,以保证使用傅里叶变换的系统的平滑性能。该研究比较了在主CPU,GPU并行计算和GPU并行计算的快速傅里叶变换的性能与内存分配优化。从实验结果中,经过证明GPU并联计算可有效提高FFT的计算速度;在32678 8字节复杂的输入数据上运行时,CPU的GPU并行计算的加速比率可以达到48。另外,通过优化GPU存储器分配,可以进一步增强FFT的计算速度;在32678 8字节复杂输入数据上运行时,GPU与CPU的存储器分配优化的Speedup比率可以达到114.7。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号