Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU

机译：揭示100x GPU与CPU神话：对CPU和GPU的吞吐量计算评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent advances in computing have led to an explosion in the amount of data being generated. Processing the ever-growing data in a timely manner has made throughput computing an important aspect for emerging applications. Our analysis of a set of important throughput computing kernels shows that there is an ample amount of parallelism in these kernels which makes them suitable for today's multi-core CPUs and GPUs. In the past few years there have been many studies claiming GPUs deliver substantial speedups (between 10X and 1000X) over multi-core CPUs on these kernels. To understand where such large performance difference comes from, we perform a rigorous performance analysis and find that after applying optimizations appropriate for both CPUs and GPUs the performance gap between an Nvidia GTX280 processor and the Intel Core i7 960 processor narrows to only 2.5x on average. In this paper, we discuss optimization techniques for both CPU and GPU, analyze what architecture features contributed to performance differences between the two architectures, and recommend a set of architectural features which provide significant improvement in architectural efficiency for throughput kernels.

机译：最近的计算进步导致正在生成的数据量的爆炸。以及时的方式处理不断增长的数据使吞吐量计算出新应用的一个重要方面。我们对一系列重要吞吐量计算内核的分析表明，这些内核中存在充足的平行性，这使得它们适用于今天的多核CPU和GPU。在过去的几年里，许多索赔GPU的研究已经在这些内核上的多核CPU上提供了大量的Speedups（10x和1000x）。要了解这么大的性能差异来自哪里，我们执行严格的性能分析，并发现在适用于CPU和GPU的优化后，NVIDIA GTX280处理器和英特尔核心I7 960处理器之间的性能差距平均仅为2.5倍。。在本文中，我们讨论了CPU和GPU的优化技术，分析了两个架构之间的架构功能，并推荐一组体系结构特征，这些功能在吞吐量内核的架构效率方面提供了显着提高。

著录项

来源
《International symposium on computer architecture》|2010年||共10页
会议地点
作者
Victor W Lee; Changkyu Kim; Jatin Chhugani; Michael Deisher; Daehyun Kim; Anthony D. Nguyen; Nadathur Satish; Mikhail Smelyanskiy; Srinivas Chennupaty; Per Hammarlund; Ronak Singhal; Pradeep Dubey;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类总体结构、系统结构;
关键词
cpu architecture; gpu architecture; performance analysis; performance measurement; software optimization; throughput computing;

机译：CPU架构;GPU架构;绩效分析;性能测量;软件优化;吞吐量计算;

相似文献

外文文献
中文文献
专利

1. Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU [J] . Victor W Lee, Changkyu Kim, Jatin Chhugani, Computer architecture news . 2010,第3期

机译：揭穿100X GPU与CPU神话：揭秘CPU和GPU上的吞吐量计算
2. GPUs for statistical data analysis in HEP: a performance study of GooFit on GPUs vs. RooFit on CPUs [J] . Alexis Pompili, Adriano Di Florio, CMS Collaboration). Journal of Physics: Conference Series . 2016,第1期

机译：用于HEP中的统计数据分析的GPU：GPU上GooFit与CPU上RooFit的性能研究
3. CPUs vs. GPUs [J] . Fortune . 2010,第3期

机译：CPU与GPU
4. Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU [C] . Victor W Lee, Changkyu Kim, Jatin Chhugani, 37th annual international symposium on computer architecture 2010 . 2010

机译：揭穿100X GPU与CPU神话：揭秘CPU和GPU上的吞吐量计算
5. On Efficient GPGPU Computing for Integrated Heterogeneous CPU-GPU Microprocessors [D] . Gerzhoy, Daniel. 2021

机译：关于集成异构CPU-GPU微处理器的高效GPGPU计算
6. Tempest: GPU-CPU computing for high-throughput database spectral matching [O] . Jeffrey A. Milloy, Brendan K. Faherty, Scott A. Gerber -1

机译：Tempest：高吞吐量数据库谱匹配的GPU-CPU计算
7. FPGA vs. Multi-Core CPUs vs. GPUs: Hands-on Experience with a Sorting Application [O] . Cristian Grozea, Zorana Bankovic, Pavel Laskov 2011

机译：FpGa与多核CpU与GpU的对比：使用分类应用程序的实践经验

Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU

摘要

著录项

相似文献

相关主题

期刊订阅