首页> 美国卫生研究院文献>other >CUDA Optimization Strategies for Compute- and Memory-Bound Neuroimaging Algorithms

【2h】

CUDA Optimization Strategies for Compute- and Memory-Bound Neuroimaging Algorithms

机译：CUDA优化策略用于计算和内存内存的神经影像算法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

As neuroimaging algorithms and technology continue to grow faster than CPU performance in complexity and image resolution, data-parallel computing methods will be increasingly important. The high performance, data-parallel architecture of modern graphical processing units (GPUs) can reduce computational times by orders of magnitude. However, its massively threaded architecture introduces challenges when GPU resources are exceeded. This paper presents optimization strategies for compute- and memory-bound algorithms for the CUDA architecture. For compute-bound algorithms, the registers are reduced through variable reuse via shared memory and the data throughput is increased through heavier thread workloads and maximizing the thread configuration for a single thread block per multiprocessor. For memory-bound algorithms, fitting the data into the fast but limited GPU resources is achieved through reorganizing the data into self-contained structures and employing a multi-pass approach. Memory latencies are reduced by selecting memory resources whose cache performance are optimized for the algorithm's access patterns. We demonstrate the strategies on two computationally expensive algorithms and achieve optimized GPU implementations that perform up to 6× faster than unoptimized ones. Compared to CPU implementations, we achieve peak GPU speedups of 129× for the 3D unbiased nonlinear image registration technique and 93× for the non-local means surface denoising algorithm.

著录项

期刊名称 other
作者
Daren Lee; Ivo Dinov; Bin Dong; Boris Gutman; Igor Yanovsky; Arthur W. Toga;
展开▼
作者单位

展开▼
年(卷),期 -1(106),3
年度 -1
页码 175–187
总页数 25
原文格式 PDF
正文语种
中图分类
关键词
Graphics Processing Unit (GPU) Performance Optimization Compute-bound Memory-bound CUDA Fermi Neuroimaging;

机译：图形处理单元（GPU）;性能优化;计算绑定;内存绑定;CUDA;费米;神经成像;

相似文献

外文文献
中文文献
专利

1. CUDA optimization strategies for compute- and memory-bound neuroimaging algorithms [J] . LeeD., DinovI., DongB., Computer Methods and Programs in Biomedicine: An International Journal Devoted to the Development, Implementation and Exchange of Computing Methodology and Software Systems in Biomedical Research and Medical Practice . 2012,第3期

机译：用于计算和内存绑定神经成像算法的CUDA优化策略
2. Algorithmic and language-based optimization of Marsa-LFIB4 pseudorandom number generator using OpenMP, OpenACC and CUDA [J] . Przemyslaw Stpiczynski Journal of Parallel and Distributed Computing . 2020,第Mara期

机译：使用OpenMP，OpenACC和CUDA的Marsa-LFIB4伪随机数发生器的算法和基于语言的优化
3. Evaluation of parallel particle swarm optimization algorithms within the CUDA? architecture [J] . Mussi L., Daolio F., Cagnoni S. Information Sciences: An International Journal . 2011,第20期

机译：评估CUDA中的并行粒子群优化算法？建筑
4. Algorithmic strategies for optimizing the parallel reduction primitive in CUDA [C] . Martin Pedro J., Ayuso Luis F., Torres Roberto, 2012 International Conference on High Performance Computing amp; Simulation . 2012

机译：在CUDA中优化并行约简原语的算法策略
5. Optimization techniques for mapping algorithms and applications onto CUDA GPU platforms and CPU-GPU heterogeneous platforms. [D] . Wu, Jing. 2014

机译：用于将算法和应用程序映射到CUDA GPU平台和CPU-GPU异构平台的优化技术。
6. Strategies for optimizing the phase correction algorithms in Nuclear MagneticResonance spectroscopy [O] . Franciszek Binczyk, Rafal Tarnawski, Joanna Polanska 2015

机译：优化核磁相位校正算法的策略共振光谱
7. CUDA optimization strategies for compute- and memory-bound neuroimaging algorithms [O] . Daren Lee, Ivo Dinov, Bin Dong, 2012

机译：CUDA优化策略，用于计算和内存内存的神经影像算法
8. Online Build-Order Optimization for Real-Time Strategy Agents using Multi-Objective Evolutionary Algorithms. [R] . J. M. Blackford 2014

机译：使用多目标进化算法的实时战略代理的在线构建顺序优化。

CUDA Optimization Strategies for Compute- and Memory-Bound Neuroimaging Algorithms

摘要

著录项

相似文献

相关主题

期刊订阅