首页> 外文会议>Architecture of computing systems - ARCS 2012. >An Approach for Performance Estimation of Hybrid Systems with FPGAs and GPUs as Coprocessors
【24h】

An Approach for Performance Estimation of Hybrid Systems with FPGAs and GPUs as Coprocessors

机译:FPGA和GPU作为协处理器的混合系统性能评估方法

获取原文
获取原文并翻译 | 示例

摘要

This paper presents an approach for modeling the achievable speed-ups of FPGAs (Field Programmable Gate Arrays) or GPUs (Graphic Processing Units) as coprocessors in hybrid computing systems. The underlying computation model assumes that the coprocessors are separate devices and that their input and output data are transferred from and into the system's memory. The model considers all overheads involved when (sub-)tasks are performed on a coprocessor instead of the CPU. By means of a sample application the validity of the model is checked against measured values. In addition, the theoretical maximum speed-ups of two hybrid systems compared to an optimal single core CPU implementation are approximated. Using penalty factor Pseq as a measure to which degree a program cannot be fully parallelized due to data dependencies, a system with a Nvidia GTX 285 GPU achieves a speed-up of 2.7 times Pseq, while for a single node of a Cray XDI with a Xilinx Virtex4 LX160 the speed-up is about 1 times P_(seq).
机译:本文提出了一种在混合计算系统中对作为协处理器的FPGA(现场可编程门阵列)或GPU(图形处理单元)可达到的加速进行建模的方法。基本的计算模型假定协处理器是单独的设备,并且它们的输入和输出数据是从系统内存中传入和传出的。该模型考虑了在协处理器而不是CPU上执行(子)任务时涉及的所有开销。通过样品应用,对照测量值检查模型的有效性。此外,与最佳单核CPU实施相比,两个混合系统在理论上的最大提速是近似的。使用惩罚因子Pseq作为衡量程序由于数据依赖性而无法完全并行化的程度,具有Nvidia GTX 285 GPU的系统的Pseq提升了2.7倍,而对于Cray XDI的单个节点, Xilinx Virtex4 LX160的加速约为P_(seq)的1倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号