Asynchnorous Memory Machine Models with Barrier Synchronization

Koji NAKANO

首页> 外文期刊>電子情報通信学会技術研究報告. ディペンダブルコンピュ-ティング. Dependable Computing >Asynchnorous Memory Machine Models with Barrier Synchronization

【24h】

Asynchnorous Memory Machine Models with Barrier Synchronization

机译：Asynchnorous Memory Machine Models with Barrier Synchronization

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

The Discrete Memory Machine (DMM) and the Unified Memory Machine (UMM) are theoretical parallel computing models that capture the essence of the shared memory and the global memory of GPUs. It was assumed that warps (i.e. groups of threads) on the DMM and the UMM work synchronously in the round-robin manner. However, warps work asynchronously in the actual GPUs, in the sense that warps may be randomly (or arbitrarily) dispatched for execution. The first contribution of this paper is to introduce an asynchronous version of the DMM and the UMM, in which warps are arbitrarily dispatched. Instead, we assume that threads can execute the "syncthreads" instruction for barrier synchronization. Since the barrier synchronization operation is costly, we should evaluate and minimize the number of barrier synchronization operations performed by parallel algorithms. The second contribution of this paper is to show a parallel algorithm to compute the sum of n numbers in optimal computing time and few barrier synchronization steps.

著录项

来源
《電子情報通信学会技術研究報告. ディペンダブルコンピュ-ティング. Dependable Computing》 |2012年第482期|247-252|共6页
作者
Koji NAKANO;
展开▼
作者单位

School of Engineering, Hiroshima University;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类通信保密与通信安全;
关键词
Parallel computing models; Parallel algorithms; Asynchronous models; GPU; CUDA;

Asynchnorous Memory Machine Models with Barrier Synchronization

摘要

著录项

相关主题

期刊订阅