首页> 外文会议>International symposium on computer architecture >Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU
【24h】

Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU

机译:揭示100x GPU与CPU神话:对CPU和GPU的吞吐量计算评估

获取原文

摘要

Recent advances in computing have led to an explosion in the amount of data being generated. Processing the ever-growing data in a timely manner has made throughput computing an important aspect for emerging applications. Our analysis of a set of important throughput computing kernels shows that there is an ample amount of parallelism in these kernels which makes them suitable for today's multi-core CPUs and GPUs. In the past few years there have been many studies claiming GPUs deliver substantial speedups (between 10X and 1000X) over multi-core CPUs on these kernels. To understand where such large performance difference comes from, we perform a rigorous performance analysis and find that after applying optimizations appropriate for both CPUs and GPUs the performance gap between an Nvidia GTX280 processor and the Intel Core i7 960 processor narrows to only 2.5x on average. In this paper, we discuss optimization techniques for both CPU and GPU, analyze what architecture features contributed to performance differences between the two architectures, and recommend a set of architectural features which provide significant improvement in architectural efficiency for throughput kernels.
机译:最近的计算进步导致正在生成的数据量的爆炸。以及时的方式处理不断增长的数据使吞吐量计算出新应用的一个重要方面。我们对一系列重要吞吐量计算内核的分析表明,这些内核中存在充足的平行性,这使得它们适用于今天的多核CPU和GPU。在过去的几年里,许多索赔GPU的研究已经在这些内核上的多核CPU上提供了大量的Speedups(10x和1000x)。要了解这么大的性能差异来自哪里,我们执行严格的性能分析,并发现在适用于CPU和GPU的优化后,NVIDIA GTX280处理器和英特尔核心I7 960处理器之间的性能差距平均仅为2.5倍。 。在本文中,我们讨论了CPU和GPU的优化技术,分析了两个架构之间的架构功能,并推荐一组体系结构特征,这些功能在吞吐量内核的架构效率方面提供了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号