An enhanced GPU reduction at the warp-level

Hou Neng; He Fazhi; Zhou Yi

首页> 中文期刊> 《计算机辅助绘图设计与制造（英文版）》 >An enhanced GPU reduction at the warp-level

An enhanced GPU reduction at the warp-level

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

In recent years, graphical processing unit(GPU)-accelerated intelligent algorithms have been widely utilized for solving combination optimization problems, which are NP-hard. These intelligent algorithms involves a common operation, namely reduction, in which the best suitable candidate solution in the neighborhood is selected. As one of the main procedures, it is necessary to optimize the reduction on the GPU. In this paper, we propose an enhanced warp-based reduction on the GPU. Compared with existing block-based reduction methods, our method exploit efficiently the potential of implementation at warp level, which better matches the characteristics of current GPU architecture. Firstly, in order to improve the global memory access performance, the vectoring accessing is utilized. Secondly, at the level of thread block reduction, an enhanced warp-based reduction on the shared memory are presented to form partial results. Thirdly, for the configuration of the number of thread blocks, the number of thread blocks can be obtained by maximizing the size of thread block and the maximum size of threads per stream multi-processor on GPU. Finally, the proposed method is evaluated on three generations of NVIDIA GPUs with the better performances than previous methods.

著录项

来源
《计算机辅助绘图设计与制造（英文版）》 |2016年第2期|43-52|共10页
作者
Hou Neng; He Fazhi; Zhou Yi;
展开▼
作者单位

School of Computer Science and Technology, Wuhan University, Wuhan 430072, China;

School of Computer Science and Technology, Wuhan University, Wuhan 430072, China;

School of Computer Science and Technology, Wuhan University, Wuhan 430072, China;

展开▼
原文格式 PDF
正文语种 eng
中图分类
关键词

An enhanced GPU reduction at the warp-level

摘要

著录项

相关主题

期刊订阅