首页> 外文会议>2017 IEEE 19th International Conference on High Performance Computing and Communications Workshops >Portable and Vendor-Independent Low-Level Programming and Performance Benchmarking for Graphics Cards and Processors
【24h】

Portable and Vendor-Independent Low-Level Programming and Performance Benchmarking for Graphics Cards and Processors

机译:图形卡和处理器的可移植且独立于供应商的低级编程和性能基准测试

获取原文
获取原文并翻译 | 示例

摘要

GPUs have the potential to speed up programs significantly and are one opportunity to increase the scientific reach of compute-intense scientific applications. Several new programming models based on C and other languages have evolved to leverage the potential of such parallel architectures. Still, the development of individual source code versions using different languages and APIs deteriorates the maintainability of the code. It can also lead to slightly different outputs complicating the verification of the results. For comparing the compute performance, the different nature of the processors, different pricing, and different speed grades of hardware must be taken into account. In this paper, we summarize our experience from adapting a set of applications to GPUs. We present several cases how we implement generic code for multiple architectures and how we overcome the challenges that occurred. The presented applications encompass an algorithm to reconstruct the trajectories of particles for the ALICE High Level Trigger at the Large Hadron Collider at CERN; the Linpack benchmark used for ranking the performance of supercomputers and in particular its matrix multiplication substep; Reed-Solomon based failure erasure coding for redundant data storage; Lattice Quantum Chromo Dynamics computations; and an application for evaluating electron microscopy images.
机译:GPU具有显着加快程序速度的潜力,并且是扩大计算密集型科学应用程序的科学范围的机会之一。已经开发了几种基于C和其他语言的新编程模型,以利用这种并行体系结构的潜力。但是,使用不同语言和API的单个源代码版本的开发会降低代码的可维护性。它还可能导致输出略有不同,从而使结果验证变得复杂。为了比较计算性能,必须考虑处理器的不同性质,不同的价格以及不同的硬件速度等级。在本文中,我们总结了将一组应用程序适应GPU的经验。我们将介绍几种情况,说明如何为多种架构实现通用代码以及如何克服所遇到的挑战。提出的应用程序包含一种算法,可为CERN的大型强子对撞机的ALICE高水平触发器重建粒子的轨迹; Linpack基准用于对超级计算机的性能进行排名,尤其是对其矩阵乘法子步骤进行排名;基于Reed-Solomon的故障擦除编码,用于冗余数据存储;晶格量子色动力学计算;以及用于评估电子显微镜图像的应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号