...
首页> 外文期刊>Microprocessors and microsystems >Improved GPU SIMD control flow efficiency via hybrid warp size mechanism
【24h】

Improved GPU SIMD control flow efficiency via hybrid warp size mechanism

机译:通过混合扭曲大小机制提高了GPU SIMD控制流效率

获取原文
获取原文并翻译 | 示例

摘要

High single instruction multiple data (SIMD) efficiency and low power consumption have made graphic processing units (CPUs) an ideal platform for many complex computational applications. Thousands of threads can be created by programmers and grouped into fixed-size SIMD batches, known as warps. High throughput is then achieved by concurrently executing such warps with minimal control overhead. However, if a branch instruction occurs, which assigns different paths to different threads, one warp will be broken into multiple warps that have to be executed serially, consequently reducing the efficiency advantage of SIMD. In this paper, the contemporary fixed-size warp design is abandoned for a hybrid warp size (HWS) mechanism. Mixed-size warps are generated according to HWS and are scheduled and issued flexibly. The simulation results show that this mechanism yields an average speedup of 1.20 over the baseline architecture for a wide variety of general purpose GPU applications. The paper also integrates HWS with dynamic warp formation (DWF), which is a well-known branch handling mechanism used to improve SIMD utilization by forming new warps out of split warps in real time. The simulation results show that the combination of DWF and HWS generates an average speedup of 1.27 over the DWF-only platform with an estimated area increase of about 1% of DWF.
机译:高单指令多数据(SIMD)效率和低功耗使图形处理单元(CPU)成为许多复杂计算应用程序的理想平台。程序员可以创建成千上万个线程,并将其分组为固定大小的SIMD批处理,称为warp。然后,通过以最小的控制开销同时执行此类扭曲来实现高吞吐量。但是,如果发生一条分支指令,该指令将不同的路径分配给不同的线程,则一个warp将被分解为多个必须连续执行的warp,因此会降低SIMD的效率优势。在本文中,现代的固定尺寸经纱设计被放弃用于混合经纱尺寸(HWS)机制。混合尺寸的变形是根据HWS生成的,并且可以灵活地计划和发布。仿真结果表明,对于各种各样的通用GPU应用,该机制在基线架构上的平均提速为1.20。本文还将HWS与动态变形形成(DWF)集成在一起,动态变形形成是一种众所周知的分支处理机制,用于通过实时从拆分经纱中形成新经纱来提高SIMD利用率。仿真结果表明,DWF和HWS的组合在仅DWF的平台上产生的平均速度为1.27,估计面积增加了DWF的1%。

著录项

  • 来源
    《Microprocessors and microsystems 》 |2014年第7期| 717-729| 共13页
  • 作者单位

    Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, S7N 5A9 SK, Canada;

    Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, S7N 5A9 SK, Canada;

    Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, S7N 5A9 SK, Canada;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    SIMD; GPU; Warp; Branch divergence;

    机译:信德省吉普车;模子;Thevergange分公司;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号