MAPS: Optimizing Massively Parallel Applications Using Device-Level Memory Abstraction

Rubin Eri; Levy Ely; Barak Amnon; Ben-Nun Tal

首页> 外文期刊>ACM Transactions on Architecture and Code Optimization >MAPS: Optimizing Massively Parallel Applications Using Device-Level Memory Abstraction

【24h】

MAPS: Optimizing Massively Parallel Applications Using Device-Level Memory Abstraction

机译：MAPS：使用设备级内存抽象优化大规模并行应用程序

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

GPUs play an increasingly important role in high-performance computing. While developing naive code is straightforward, optimizing massively parallel applications requires deep understanding of the underlying architecture. The developer must struggle with complex index calculations and manual memory transfers. This article classifies memory access patterns used in most parallel algorithms, based on Berkeley's Parallel "Dwarfs." It then proposes the MAPS framework, a device-level memory abstraction that facilitates memory access on GPUs, alleviating complex indexing using on-device containers and iterators. This article presents an implementation of MAPS and shows that its performance is comparable to carefully optimized implementations of real-world applications.

机译：GPU在高性能计算中扮演着越来越重要的角色。尽管开发朴素代码很简单，但是优化大规模并行应用程序需要深刻理解底层体系结构。开发人员必须为复杂的索引计算和手动内存传输而苦恼。本文根据伯克利的Parallel“ Dwarfs”对大多数并行算法中使用的内存访问模式进行分类。然后，它提出了MAPS框架，这是一种设备级别的内存抽象，可促进GPU上的内存访问，并使用设备上的容器和迭代器减轻复杂的索引编制。本文介绍了MAPS的实现，并显示了其性能与实际应用程序中经过精心优化的实现相当。

著录项

来源
《ACM Transactions on Architecture and Code Optimization》 |2014年第4期|共22页
作者
Rubin Eri; Levy Ely; Barak Amnon; Ben-Nun Tal;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Parallelism; Abstraction; Performance; GPGPU; memory abstraction; heterogeneous computing architectures; memory access patterns;

机译：并行性;抽象;性能;GPGPU;内存抽象;异构计算架构;内存访问模式;

相似文献

外文文献
中文文献
专利

1. MAPS: Optimizing Massively Parallel Applications Using Device-Level Memory Abstraction [J] . Rubin Eri, Levy Ely, Barak Amnon, ACM Transactions on Architecture and Code Optimization . 2014,第4期

机译：MAPS：使用设备级内存抽象优化大规模并行应用程序
2. Design and Analysis of 3D-MAPS (3D Massively Parallel Processor with Stacked Memory) [J] . Kim D.H., Athikulwongse K., Healy M.B., Computers, IEEE Transactions on . 2015,第1期

机译：3D-MAPS（具有堆栈存储器的3D大规模并行处理器）的设计和分析
3. ENPASSANT: AN ENVIRONMENT FOR EVALUATING MASSIVELY PARALLEL ARRAY ARCHITECTURES FOR SPATIALLY MAPPED APPLICATIONS [J] . MARTIN C. HERBORDT, CHARLES C. WEEMS International Journal of Pattern Recognition and Artificial Intelligence . 1995,第2期

机译：附件：用于评估空间映射应用程序的大规模并行阵列体系结构的环境
4. 3D-MAPS: 3D Massively parallel processor with stacked memory [C] . Dae Hyun Kim, Athikulwongse K., Healy M., Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2012 IEEE International . 2012

机译：3D-MAPS：具有堆叠内存的3D大规模并行处理器
5. Optimizing performance on massively parallel computers using a remote memory access programming model. [D] . Krishnan, Manojkumar. 2010

机译：使用远程内存访问编程模型在大型并行计算机上优化性能。
6. Massively parallel nonparametric regression with an application to developmental brain mapping [O] . Philip T. Reiss, Lei Huang, Yin-Hsiu Chen, -1

机译：大规模并行非参数回归及其在脑发育图研究中的应用
7. Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures [O] . Erick C. Castelli, Michelle A. Paz, Andréia S. Souza, 2018

机译：HLA-MAPPER：用于优化通过大规模平行测序程序产生的HLA序列映射的应用程序

MAPS: Optimizing Massively Parallel Applications Using Device-Level Memory Abstraction

摘要

著录项

相似文献

相关主题

期刊订阅