首页> 外文会议>IEEE International Conference on High Performance Computing and Communications Workshops >Portable and Vendor-Independent Low-Level Programming and Performance Benchmarking for Graphics Cards and Processors
【24h】

Portable and Vendor-Independent Low-Level Programming and Performance Benchmarking for Graphics Cards and Processors

机译:便携式和供应商独立的低级编程和显卡和处理器的性能基准测试

获取原文

摘要

GPUs have the potential to speed up programs significantly and are one opportunity to increase the scientific reach of compute-intense scientific applications. Several new programming models based on C and other languages have evolved to leverage the potential of such parallel architectures. Still, the development of individual source code versions using different languages and APIs deteriorates the maintainability of the code. It can also lead to slightly different outputs complicating the verification of the results. For comparing the compute performance, the different nature of the processors, different pricing, and different speed grades of hardware must be taken into account. In this paper, we summarize our experience from adapting a set of applications to GPUs. We present several cases how we implement generic code for multiple architectures and how we overcome the challenges that occurred. The presented applications encompass an algorithm to reconstruct the trajectories of particles for the ALICE High Level Trigger at the Large Hadron Collider at CERN; the Linpack benchmark used for ranking the performance of supercomputers and in particular its matrix multiplication substep; Reed-Solomon based failure erasure coding for redundant data storage; Lattice Quantum Chromo Dynamics computations; and an application for evaluating electron microscopy images.
机译:GPU有可能显着加快节目,并有机会增加计算强烈的科学应用的科学覆盖范围。基于C和其他语言的几种新的编程模型已经发展起来利用这种并行架构的潜力。尽管如此,使用不同语言和API的各个源代码版本的开发劣化了代码的可维护性。它还可以导致略有不同的输出使结果验证复杂化。为了比较计算性能,必须考虑处理器,不同定价和不同速度等级的不同性质。在本文中,我们总结了我们将一组应用程序适应GPU的经验。我们提供了几个案例我们如何为多个架构实施通用代码以及我们如何克服所发生的挑战。所呈现的应用包括一种算法,可以在CERN的大型HADRON撞机中重建粒子的颗粒轨迹; LINPACK基准测试用于排名超级计算机的性能,特别是其矩阵乘法子步骤;基于Reed-Solomon的故障擦除编码,用于冗余数据存储;晶格量子Chromo动力学计算;以及评估电子显微镜图像的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号