...
首页> 外文期刊>ACM Transactions on Embedded Computing Systems >GPU-Optimized Volume Ray Tracing for Massive Numbers of Rays in Radiotherapy
【24h】

GPU-Optimized Volume Ray Tracing for Massive Numbers of Rays in Radiotherapy

机译:GPU优化的体积射线追踪,可用于放射治疗中的大量射线

获取原文
获取原文并翻译 | 示例
           

摘要

Ray tracing within a uniform grid volume is a fundamental process invoked frequently by many applications, especially radiation-dose calculation methods in radiotherapy. However, the conflicting features between the GPU memory architecture and the memory-accessing patterns of volume ray tracing lead to inefficient usage of GPU memory bandwidth and waste of capability of modern GPUs. To improve the ray tracing performance on GPU, we propose a lookup-table-based ray tracing method which is specially optimized towards the GPU. memory system for processing a massive number of rays. The proposed method is based on a key observation that many of these applications normally involves a massive number of rays, but their ray tracing may not need to follow a specific execution order. Therefore, we divide the 3D space into many regions (called pyramids) and group together the rays falling into the same pyramid. For each ray group, the volume is rotated and resampled for their raytracing. This divide-and-rotate strategy allows the memory access of the ray tracing process to adopt a table-lookup approach and leads to better memory coalescing on GPU. Our proposed method was thoroughly evaluated in four volume setups with randomly-generated rays. The collapsed-cone convolution/superposition (CCCS) dose calculation method is also implemented with/without the proposed approach to verify the feasibility of our method. Compared with the direct GPU implementation of the popular 3DDDA algorithm, our method provides a speedup in the range of 1.91-2.94X for the volume settings we used. Major performance factors, including ray origins, volume size, and pyramid size, are also analyzed. The proposed technique was also found to be able to give a speedup of 1.61-2.17X over the original GPU implementation of the CCCS algorithm. Our experiment results indicate that the proposed approach is capable of offering better coalesced memory access which eventually boosts the raytracing performance on GPU. Moreover, our approach is conceptually simple and can be readily included into various applications.
机译:在均匀的网格体积内进行射线追踪是许多应用程序经常调用的基本过程,尤其是放射治疗中的辐射剂量计算方法。但是,GPU内存体系结构与体射线跟踪的内存访问模式之间的冲突特性导致GPU内存带宽的使用效率低下,并浪费了现代GPU的功能。为了提高GPU上的光线跟踪性能,我们提出了一种基于查找表的光线跟踪方法,该方法针对GPU进行了特别优化。用于处理大量光线的存储系统。所提出的方法基于一个关键的观察,即这些应用程序中的许多通常都涉及大量光线,但是它们的光线跟踪可能不需要遵循特定的执行顺序。因此,我们将3D空间划分为多个区域(称为金字塔),并将落入同一金字塔的光线分组在一起。对于每个射线组,将​​对其进行旋转并重新采样以进行射线跟踪。这种划分和旋转策略允许光线跟踪过程的内存访问采用表查找方法,并导致在GPU上更好的内存合并。我们提出的方法在随机产生的射线的四个体积设置中进行了彻底评估。折叠锥卷积/叠加(CCCS)剂量计算方法也可以采用或不采用所提出的方法来验证本方法的可行性。与流行的3DDDA算法的直接GPU实现相比,对于所使用的音量设置,我们的方法可将速度提高1.91-2.94X。还分析了主要的性能因素,包括射线起源,体积大小和金字塔尺寸。还发现,所提出的技术能够比CCCS算法的原始GPU实现提速1.61-2.17倍。我们的实验结果表明,所提出的方法能够提供更好的合并内存访问,最终提高GPU上的光线跟踪性能。而且,我们的方法在概念上很简单,可以轻松地包含在各种应用程序中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号