首页> 外文会议>Parallel, Distributed and Network-Based Processing (PDP), 2012 20th Euromicro International Conference on >Optimization Techniques and Performance Analyses of Two Life Science Algorithms for Novel GPU Architectures
【24h】

Optimization Techniques and Performance Analyses of Two Life Science Algorithms for Novel GPU Architectures

机译:新型GPU架构的两种生命科学算法的优化技术和性能分析

获取原文
获取原文并翻译 | 示例

摘要

In this paper we evaluate two life science algorithms, namely Needleman-Wunsch sequence alignment and Direct Coulomb Summation, for GPUs. Whereas for Needleman-Wunsch it is difficult to get good performance numbers, Direct Coulomb Summation is particularly suitable for graphics cards. We present several optimization techniques, analyze the theoretical potential of the optimizations with respect to the algorithms, and measure the effect on execution times. We target the recent NVIDIA Fermi architecture to evaluate the performance impacts of novel hardware features like the cache subsystem on optimizing transformations. We compare the execution times of CUDA and OpenCL code versions for Fermi and predecessor models with parallel OpenMP versions executed on the main CPU.
机译:在本文中,我们评估了GPU的两种生命科学算法,即Needleman-Wunsch序列比对和直接库仑求和。对于Needleman-Wunsch来说,很难获得良好的性能数据,而Direct Coulomb Summation特别适合于图形卡。我们提出了几种优化技术,分析了关于算法的优化的理论潜力,并测量了对执行时间的影响。我们以最新的NVIDIA Fermi架构为目标,以评估诸如缓存子系统之类的新型硬件功能对优化转换的性能影响。我们将Fermi和先前型号的CUDA和OpenCL代码版本的执行时间与在主CPU上执行的并行OpenMP版本进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号