首页> 外文会议>Symposium on Application Accelerators in High Performance Computing >A Multi-Node GPGPU Implementation of Non-Linear Anisotropic Diffusion Filter
【24h】

A Multi-Node GPGPU Implementation of Non-Linear Anisotropic Diffusion Filter

机译:非线性各向异性扩散滤波器的多节点GPGPU实现

获取原文

摘要

The quality of an image is highly critical for applications such as robotic vision, surveillance, medical imaging, etc. The images captured in real-time are seldom noise free and therefore require noise removal for further processing. Out of several proposed noise removal schemes, an isotropic diffusion filtering is known to achieve highly precise results. However, the accuracy comes at an expense of high computation cost, especially for large data sets. The highly parallel nature of the aforementioned filtering algorithm makes it a good candidate for the General Purpose Graphical Processing Unit (GPGPU) clusters. In this research, we present a GPGPU cluster-based implementation of the non-linear an isotropic diffusion filter. Our implementation maps the computationally intensive parts of the algorithm to the GPGPU devices while the communication and serial processing are performed by the CPU hosts. Our efficiently mapped multi-node GPGPU implementation is capable of processing images as large as 156 mega-pixels and achieves a speed-up of 29x over an equivalent MPI-only implementation. In addition, our multi-node GPGPU implementation exhibits reasonable scaling behavior that improves with the size of the images.
机译:图像的质量对于诸如机器人视觉,监视,医学成像等的应用非常重要。实时捕获的图像很少无噪声,因此需要噪声去除以进一步处理。除了几种提出的噪声去除方案中,已知各向同性扩散滤波来实现高精度的结果。然而,精度以高计算成本为代价,特别是对于大数据集。上述滤波算法的高度平行性质使其成为通用图形处理单元(GPGPU)集群的良好候选者。在本研究中,我们介绍了基于GPGPU的基于聚类的非线性扩散滤波器的实现。我们的实现将算法的计算密集型部分映射到GPGPU设备,同时CPU主机执行通信和串行处理。我们有效映射的多节点GPGPU实现能够处理大约156兆像素的图像,并通过同等的MPI实现实现29倍的速度。此外,我们的多节点GPGPU实现具有合理的缩放行为,可提高图像的大小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号