首页> 外文期刊>Microelectronics & Reliability >Evaluating the soft error sensitivity of a GPU-based SoC for matrix multiplication
【24h】

Evaluating the soft error sensitivity of a GPU-based SoC for matrix multiplication

机译:评估基于GPU的SOC的软错误敏感性矩阵乘法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

System-on-Chip (SoC) devices can be composed of low-power multicore processors combined with a small graphics accelerator (or GPU) which offers a trade-off between computational capacity and low-power consumption. In this work we use the LLFI-GPU fault injection tool on one of these devices to compare the sensitivity to soft errors of two different CUDA versions of matrix multiplication benchmark. Specifically, we perform fault injection campaigns on a Jetson TK1 development kit, a board equipped with a SoC including an NVIDIA "Kepler" Graphics Processing Unit (GPU). We evaluate the effect of modifying the size of the problem and also the thread-block size on the behaviour of the algorithms. Our results show that the block version of the matrix multiplication benchmark that leverages the shared memory of the GPU is not only faster than the element-wise version, but it is also much more resilient to soft errors. We also use the cuda-gdb debugger to analyze the main causes of the crashes in the code due to soft errors. Our experiments show that most of the errors are due to accesses to invalid positions of the different memories of the GPU, which causes that the block version suffers a higher percentage of this kind of errors.
机译:系统上的系统(SOC)设备可以由低功耗多核处理器组成,与小图形加速器(或GPU)相结合,在计算能力和低功耗之间提供权衡。在这项工作中,我们将LLFI-GPU故障注入工具在其中一个设备上比较了两种不同CUDA版本的矩阵乘法基准的敏感性。具体而言,我们在Jetson TK1开发套件上执行故障注入活动,该板配备有SOC,包括NVIDIA“开普勒”图形处理单元(GPU)。我们评估修改问题大小以及算法行为的线程块大小的效果。我们的结果表明,利用GPU的共享内存的矩阵乘法基准的块版本不仅比元素-IND vise更快,而且对软错误也更具弹性。我们还使用CUDA-GDB调试器来分析由于软错误而在代码中崩溃的主要原因。我们的实验表明,大多数错误都是由于访问GPU的不同存储器的无效位置,这导致块版本遭受了更高的这种错误百分比。

著录项

  • 来源
    《Microelectronics & Reliability》 |2020年第11期|113856.1-113856.5|共5页
  • 作者单位

    Univ Jaume I Castello Dept Ingn & Ciencia Computadores Castellon De La Plana Spain;

    Univ Jaume I Castello Dept Ingn & Ciencia Computadores Castellon De La Plana Spain;

    Univ Carlos III Madrid Dept Tecnol Elect Madrid Spain;

    Univ Carlos III Madrid Dept Tecnol Elect Madrid Spain;

    Univ Carlos III Madrid Dept Tecnol Elect Madrid Spain;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    GPU; Soft errors; Sensitivity; Fault injection;

    机译:GPU;软错误;敏感性;故障注射;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号