GPU and FPGA Coprocessors for Data Intensive Computations.

机译：用于数据密集型计算的GPU和FPGA协处理器。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the current norm of multi-core processors, stagnant clock rates, and slowing gains from instruction level parallelism, it has become increasingly important to exploit parallelism in order to achieve acceptable performance for data intensive tasks. While multi-core processors are fine for exploiting thread-level parallelism, they are often a suboptimal choice for problems that exhibit abundant data parallelism. This thesis investigates the application of Graphics Processing Units (GPUs) and Field Programmable Gate Array (FPGA) coprocessors for data intensive, data parallel workloads.;Since adopting a unified shader architecture and a general programming model, GPUs have become an increasingly important alternative to general-purpose processors for compute intensive applications, since they feature peak floating-point performance well above that of general-purpose processors. We investigate GPU coprocessors for a simple particle simulation and demonstrate the performance benefit of offloading spatial transformations and basic particle motion calculations to a GPU. We also study a GPU coprocessor for the k-Means clustering algorithm and demonstrate application speedups of 40-70x.;FPGAs are hardware devices capable of implementing arbitrary digital circuits. The vast internal bandwidth and low power consumption afforded by these devices makes them an attractive target for certain data parallel workloads. We investigate FPGA architecture for Decision Tree Classification that can achieve a speedup of 30x for the split determination phase of the algorithm. We also present a fast pairwise statistical significance estimation architecture using an FPGA coprocessor that offloads the alignment task to an accelerator designed to concurrently process multiple independent alignments, resulting in an end-to-end speedup of over 200x over a baseline software implementation.

机译：随着当前多核处理器规范的发展，时钟速率的停滞以及指令级并行性的缓慢增长，利用并行性以实现数据密集型任务的可接受性能变得越来越重要。尽管多核处理器可以很好地利用线程级并行性，但对于表现出大量数据并行性的问题，它们通常不是最佳选择。本文研究了图形处理单元（GPU）和现场可编程门阵列（FPGA）协处理器在数据密集型，数据并行工作负载中的应用。由于采用统一的着色器体系结构和通用的编程模型，GPU成为了越来越重要的替代产品用于计算密集型应用程序的通用处理器，因为它们的峰值浮点性能远高于通用处理器。我们研究了用于简单粒子模拟的GPU协处理器，并演示了将空间变换和基本粒子运动计算卸载到GPU上的性能优势。我们还研究了用于k-Means聚类算法的GPU协处理器，并演示了40-70x的应用加速。FPGA是能够实现任意数字电路的硬件设备。这些设备提供的巨大内部带宽和低功耗使其成为某些数据并行工作负载的有吸引力的目标。我们研究了用于决策树分类的FPGA体系结构，该体系结构在算法的拆分确定阶段可以实现30倍的加速。我们还提出了一种使用FPGA协处理器的快速成对统计显着性估计架构，该协处理器将比对任务分流到设计用于同时处理多个独立比对的加速器中，从而使端到端的速度比基线软件实现高200倍以上。

著录项

作者
Honbo, Daniel.;
展开▼
作者单位

Northwestern University.;

展开▼
授予单位 Northwestern University.;
学科 Computer engineering.
学位 Ph.D.
年度 2014
页码 64 p.
总页数 64
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. FPGAs versus GPUs in Data centers [J] . Babak Falsafi, Bill Dally, Desh Singh, IEEE Micro . 2017,第1期

机译：数据中心中的FPGA与GPU
2. Using Data Compression for Increasing Efficiency of Data Transfer Between Main Memory and Intel Xeon Phi Coprocessor or NVidia GPU in Parallel DBMS [J] . Konstantin Y. Besedin, Pavel S. Kostenetskiy, Stepan O. Prikazchikov Procedia Computer Science . 2015,第1期

机译：使用数据压缩来提高并行DBMS中主内存与Intel Xeon Phi协处理器或NVidia GPU之间的数据传输效率
3. FPGA implementation of a wireless sensor node with built-in security coprocessors for secured key exchange and data transfer [J] . Toubal Abdelmoughni, Bengherbia Billel, Zmirli Mohamed Ould, Measurement . 2020,第期

机译：FPGA实现具有内置安全协处理器的无线传感器节点，用于安全密钥交换和数据传输
4. An Approach for Performance Estimation of Hybrid Systems with FPGAs and GPUs as Coprocessors [C] . Volker Hampel, Thilo Pionteck, Erik Maehle Architecture of computing systems - ARCS 2012. . 2012

机译：FPGA和GPU作为协处理器的混合系统性能评估方法
5. Accelerating molecular docking and binding site mapping using FPGAs and GPUs. [D] . Sukhwani, Bharat. 2011

机译：使用FPGA和GPU加速分子对接和结合位点定位。
6. Optimizing Data Intensive GPGPU Computations for DNA Sequence Alignment [O] . Cole Trapnell, Michael C. Schatz -1

机译：优化DNA序列对齐的数据密集型GPGPU计算
7. Using Data Compression for Increasing Efficiency of Data Transfer Between Main Memory and Intel Xeon Phi Coprocessor or NVidia GPU in Parallel DBMS [O] . Besedin Konstantin Y., Kostenetskiy Pavel S., Prikazchikov Stepan O. 2015

机译：使用数据压缩来提高并行DBMS中主内存与Intel Xeon Phi协处理器或NVidia GPU之间的数据传输效率

GPU and FPGA Coprocessors for Data Intensive Computations.

摘要

著录项

相似文献

相关主题

期刊订阅