首页> 外文会议>SEM International Congress Exposition on Experimental and Applied Mechanics >Fast, Sub-pixel Accurate Digital Image Correlation Algorithm Powered by Heterogeneous (CPU-GPU) Framework
【24h】

Fast, Sub-pixel Accurate Digital Image Correlation Algorithm Powered by Heterogeneous (CPU-GPU) Framework

机译:基于异构(CPU-GPU)框架的快速,子像素精确数字图像相关算法

获取原文
获取外文期刊封面目录资料

摘要

Digital Image Correlation (DIC) is a popular non-contact image-based full-field deformation measurement tool widely used in mechanics. In spite of its significant advantages, it is still primarily used as a post-processing tool due to its computational cost. In recent years, parallel computing platforms such as multi-core processors and Graphics Processing Units (GPUs) have been used to improve the speed of the DIC algorithm, with GPUs being well-suited for implementing data-parallel operations. Previous works have performed GPU-based DIC wherein each sub-image (i.e. a collection of a few pixels in the local neighborhood of a point of interest) is allocated to a single thread on the GPU, thus achieving parallelism across sub-images. However, this is not the only type of parallelism that is possible: one can also achieve parallelism within a sub-image as well as across whole images. The aim of this work is to efficiently implement 2D-DIC such that parallelism within a sub-image as well as across sub-images leads to considerable reduction in computation time. We use a heterogeneous framework consisting of an Intel Xeon octa-core CPU and an Nvidia Tesla K20C GPU card in this work. The CPU is used to handle image pre-processing, whereas the GPU is used to process four compute-intensive tasks: affine shape function computation, B-Spline interpolation, residual vector calculation and deformation vector update. Parallelization within and across sub-images is achieved in this work by efficient thread handling and use of pre-compiled BLAS libraries. In order to estimate the speedup provided by the GPU, the same four tasks were also evaluated on the octa-core CPU; a speedup of approximately 7 to 5 times was observed for a single sub-image whose size varies from 21×21 to 61×61 respectively. However, it is expected that for a larger number of sub-images, the GPU speedup will be higher and this is indeed the case: when the affine shape function computation and B-Spline interpolation steps were evaluated on 1869 21×21 pixel sub-images, the speedup was around a more impressive 453 times. Further GPU optimization as well as parallelization across image pairs is currently underway and even faster GPU-assisted DIC seems achievable.
机译:数字图像相关(DIC)是一种流行的基于非接触式图像的全场变形测量工具,广泛用于力学。尽管有显着的优势,但由于其计算成本,它仍然主要用作后处理工具。近年来,已使用平行计算平台,例如多核处理器和图形处理单元(GPU)来提高DIC算法的速度,并且GPU非常适合实现数据并行操作。以前的作品已经执行了基于GPU的DIC,其中每个子图像(即兴趣点的本地附近的局部附近的几个像素的集合)被分配给GPU上的单个线程,从而在跨子图像上实现并行性。但是,这不是可能的唯一行为的行为:一个人也可以在子图像中实现并行性以及整个图像。本作作品的目的是有效地实现2D-DIC,使得子图像内的并行性以及跨子图像导致计算时间的相当大降低。我们使用由英特尔Xeon Octa核心CPU和NVIDIA Tesla K20C GPU卡组成的异构框架。 CPU用于处理图像预处理,而GPU用于处理四个计算密集型任务:仿射形状函数计算,B样条插值,残差矢量计算和变形矢量更新。通过高效的线程处理和使用预编译的BLAS库,在这项工作中实现并行化在这项工作中实现。为了估计GPU提供的加速,还在Octa-Cod CPU上评估了相同的四个任务;对于单个子图像观察到大约7〜5次的加速,其大小分别从21×21到61×61变化。但是,对于更大数量的子图像,GPU加速将更高,这确实如此:当在1869 21×21像素子区域评估仿射形状函数计算和B样条内插步骤时图像,加速是令人印象深刻的453次。进一步的GPU优化以及图像对的并行化目前正在进行中,甚至更快的GPU辅助DIC似乎可实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号