首页> 外文OA文献 >Real-time GPU-based software beamformer designed for advanced imagingmethods research
【2h】

Real-time GPU-based software beamformer designed for advanced imagingmethods research

机译:基于GpU的实时软件波束形成器,专为高级成像方法研究而设计

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

High computational demand is known to be a technical hurdle for real-timeimplementation of advanced methods like synthetic aperture imaging (SAI) andplane wave imaging (PWI) that work with the pre-beamform data of each arrayelement. In this paper, we present the development of a software beamformer forSAI and PWI with real-time parallel processing capacity. Our beamformer designcomprises a pipelined group of graphics processing units (GPU) that are hostedwithin the same computer workstation. During operation, each available GPU isassigned to perform demodulation and beamforming for one frame of pre-beamformdata acquired from one transmit firing (e.g. point firing for SAI). Tofacilitate parallel computation, the GPUs have been programmed to treat thecalculation of depth pixels from the same image scanline as a block ofprocessing threads that can be executed concurrently, and it would repeat thisprocess for all scanlines to obtain the entire frame of image data i.e.low-resolution image (LRI). To reduce processing latency due to repeated accessof each GPU's global memory, we have made use of each thread block's fast-sharedmemory (to store an entire line of pre-beamform data during demodulation),created texture memory pointers, and utilized global memory caches (to streamrepeatedly used data samples during beamforming). Based on this beamformerarchitecture, a prototype platform has been implemented for SAI and PWI, and itsLRI processing throughput has been measured for test datasets with 40 MHzsampling rate, 32 receive channels, and imaging depths between 5-15 cm. Whenusing two Fermi-class GPUs (GTX-470), our beamformer can compute LRIs of512-by-255 pixels at over 3200 fps and 1300 fps respectively for imaging depthsof 5 cm and 15 cm. This processing throughput is roughly 3.2 times higher than aTesla-class GPU (GTX-275). © 2010 IEEE.
机译:众所周知,高计算需求是实时实现先进方法(例如合成孔径成像(SAI)和平面波成像(PWI))的技术障碍,该方法可与每个阵列元素的波束前数据一起工作。在本文中,我们介绍了具有实时并行处理能力的SAI和PWI软件波束形成器的开发。我们的Beamformer设计包含在同一台计算机工作站中托管的一组流水线图形处理单元(GPU)。在操作期间,分配每个可用的GPU,以对从一个发射触发(例如,SAI的点触发)获取的一帧预波束形数据执行解调和波束成形。为了便于并行计算,已对GPU进行了编程,以将来自同一图像扫描线的深度像素的计算作为可并行执行的处理线程块进行处理,并且它将对所有扫描线重复此过程以获得图像数据的整个帧,即分辨率图像(LRI)。为了减少由于重复访问每个GPU的全局内存而导致的处理延迟,我们利用了每个线程块的快速共享内存(在解调过程中存储了整行pre-beamform数据),创建了纹理内存指针,并利用了全局内存缓存(在波束成形过程中重复使用重复的数据样本)。基于此波束成形架构,已为SAI和PWI实现了原型平台,并针对40 MHz采样率,32个接收通道以及5至15 cm成像深度的测试数据集测量了其LRI处理吞吐量。当使用两个费米级GPU(GTX-470)时,我们的波束形成器可以分别在3200 fps和1300 fps以上的情况下针对5 cm和15 cm的成像深度计算512 x 255像素的LRI。该处理吞吐量大约是Tesla级GPU(GTX-275)的3.2倍。 ©2010 IEEE。

著录项

  • 作者

    Yu ACH; Yiu BYS; Tsang IKH;

  • 作者单位
  • 年度 2010
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号