Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU

Bo Wu; Zhijia Zhao; Eddy Z. Zhang; Yunlian Jiang; Xipeng Shen

首页> 外文期刊>ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages >Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU

【24h】

Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU

机译：数据重组以最大程度减少GPU上的非批量内存访问的复杂度分析和算法设计

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of Graphic Processing Units (GPU) is sensitive to irregular memory references. Some recent work shows the promise of data reorganization for eliminating non-coalesced memory accesses that are caused by irregular references. However, all previous studies have employed simple, heuristic methods to determine the new data layouts to create. As a result, they either do not provide any performance guarantee or are effective to only some limited scenarios. This paper contributes a fundamental study to the problem. It systematically analyzes the inherent complexity of the problem in various settings, and for the first time, proves that the problem is NP-complete. It then points out the limitations of existing techniques and reveals that in practice, the essence for designing an appropriate data reorganization algorithm can be reduced to a tradeoff among space, time, and complexity. Based on that insight, it develops two new data reorganization algorithms to overcome the limitations of previous methods. Experiments show that an assembly composed of the new algorithms and a previous algorithm can circumvent the inherent complexity in finding optimal data layouts, making it feasible to minimize non-coalesced memory accesses for a variety of irregular applications and settings that are beyond the reach of existing techniques. Categories and Subject Descriptors D.3.4 [Programming Languages]: Processors-optimization, compilers General Terms Performance, Experimentation

机译：图形处理单元（GPU）的性能对不规则的内存引用很敏感。最近的一些工作表明了数据重组的希望，以消除由不规则引用引起的非协商内存访问。但是，所有以前的研究都采用简单的启发式方法来确定要创建的新数据布局。结果，它们要么不提供任何性能保证，要么仅对某些有限的情况有效。本文对该问题做出了基础研究。它系统地分析了各种情况下问题的内在复杂性，并首次证明了该问题是NP完全的。然后指出了现有技术的局限性，并揭示了在实践中，设计适当的数据重组算法的本质可以简化为空间，时间和复杂性之间的权衡。基于这一见解，它开发了两种新的数据重组算法来克服以前方法的局限性。实验表明，由新算法和先前算法组成的程序集可以规避固有的复杂性，以寻找最佳的数据布局，从而使针对非常规应用程序和设置的非高级内存访问最小化是可行的，这超出了现有技术的范围技术。类别和主题描述符D.3.4 [编程语言]：处理器优化，编译器通用术语性能，实验

著录项

来源
《ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages 》 |2013年第8期| 共12页
作者
Bo Wu; Zhijia Zhao; Eddy Z. Zhang; Yunlian Jiang; Xipeng Shen;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类 73.96;
关键词
GPGPU; Memory coalescing; Computational complexity; Thread-data remapping; Runtime optimizations; Data transformation;

机译：GPGPU;内存合并;计算复杂度;线程数据重映射;运行时优化;数据转换;

相似文献

外文文献
中文文献
专利

1. Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU [J] . Bo Wu, Zhijia Zhao, Eddy Z. Zhang, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2013 ,第8期

机译：数据重组以最大程度减少GPU上的非批量内存访问的复杂度分析和算法设计
2. GPU-DAEMON: GPU algorithm design, data management & optimization template for array based big omics data [J] . Awan Muaaz Gul, Eslami Taban, Saeed Fahad Computers in Biology and Medicine . 2018 ,第期

机译：GPU-守护程序：基于阵列的大OMIC数据的GPU算法设计，数据管理和优化模板
3. Multi-GPU parallel algorithm design and analysis for improved inversion of probability tomography with gravity gradiometry data [J] . Hou Zhenlong, Huang Danian Journal of Applied Geophysics . 2017 ,第期

机译：具有重力梯度数据的概率断层扫描改善逆变的多GPU并行算法设计与分析
4. Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU [C] . Bo Wu, Zhijia Zhao, Eddy Z. Zhang, ACM SIGPLAN Symposium on Priciples and Practice of Parallel Programming . 2013

机译：复杂性分析和算法设计，用于重组数据，以最大限度地减少GPU上的非聚结的内存访问
5. Advanced Concurrency Control Algorithm Design and GPU System Support for High Performance In-Memory Data Management. [D] . Yuan, Yuan. 2016

机译：用于高性能内存数据管理的高级并发控制算法设计和GPU系统支持。
6. GPU-DAEMON: GPU Algorithm Design Data Management Optimization template for array based big omics data [O] . Muaaz Gul Awan, Taban Eslami, Fahad Saeed -1

机译：GPU-DAEMON：用于基于阵列的大组学数据的GPU算法设计数据管理和优化模板
7. Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU ∗ [O] . Bo Wu, Zhijia Zhao, Eddy Z. Zhang, 2013

机译：用于重组数据的复杂性分析和算法设计，以最小化GpU上的非合并内存访问*

Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU

摘要

著录项

相似文献

相关主题

期刊订阅