首页> 外文会议>Embedded computer systems: Architectures, modeling, and simulation >Efficient Mapping of Multiresolution Image Filtering Algorithms on Graphics Processors
【24h】

Efficient Mapping of Multiresolution Image Filtering Algorithms on Graphics Processors

机译:图形处理器上多分辨率图像过滤算法的高效映射

获取原文
获取原文并翻译 | 示例

摘要

In the last decade, there has been a dramatic growth in research and development of massively parallel commodity graphics hardware both in acade-mia and industry. Graphics card architectures provide an optimal platform for parallel execution of many number crunching loop programs from fields like image processing, linear algebra, etc. However, it is hard to efficiently map such algorithms to the graphics hardware even with detailed insight into the architecture. This paper presents a multiresolution image processing algorithm and shows the efficient mapping of this type of algorithms to the graphics hardware. Furthermore, the impact of execution configuration is illustrated and a method is proposed to determine the best configuration offline in order to use it at run-time. Using CUDA as programming model, it is demonstrated that the image processing algorithm is significantly accelerated and that a speedup of up to 33x can be achieved on NVIDIA's Tesla C870 compared to a parallelized implementation on a Xeon Quad Core.
机译:在过去的十年中,学术界和工业界对大规模并行商品图形硬件的研究与开发有了巨大的增长。图形卡体系结构为并行执行来自图像处理,线性代数等领域的许多数字紧缩循环程序提供了一个最佳平台。但是,即使对体系结构有详细的了解,也很难有效地将此类算法映射到图形硬件。本文提出了一种多分辨率图像处理算法,并显示了这种算法到图形硬件的有效映射。此外,说明了执行配置的影响,并提出了一种离线确定最佳配置以便在运行时使用它的方法。使用CUDA作为编程模型,证明了图像处理算法得到了显着加速,与在Xeon四核上的并行实现相比,在NVIDIA的Tesla C870上可以实现高达33倍的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号