首页> 外文会议>2011 17th IEEE International Conference on Parallel and Distributed Systems >GPU Implementation of Orthogonal Matching Pursuit for Compressive Sensing
【24h】

GPU Implementation of Orthogonal Matching Pursuit for Compressive Sensing

机译:正交匹配追踪的GPU压缩感知实现

获取原文

摘要

Recovery algorithms play a key role in compressive sampling (CS). Currently, a popular recovery algorithm for CS is the orthogonal matching pursuit (OMP), which possesses the merits of low complexity and good recovery quality. Considering that the OMP involves massive matrix/vector operations, it is very suited to being implemented in parallel on graphics processing unit (GPU). In this paper, we first analyze the complexity of each module in the OMP and point out the bottlenecks of the OMP lie in the projection module and the least-squares module. To speedup the projection module, Fujimoto's matrix-vector multiplication algorithm is adopted. To speedup the least-squares module, the matrixinverse-update algorithm is adopted. Experimental results show that +40x speedup is achieved by our implementation of OMP on GTX480 GPU over on Intel(R) Core(TM) i7 CPU. Since the projection module occupies more than 2/3 of the total run time, we are looking for a faster matrix-vector multiplication algorithm.
机译:恢复算法在压缩采样(CS)中起着关键作用。目前,用于CS的流行恢复算法是正交匹配追踪(OMP),它具有低复杂度和良好的恢复质量的优点。考虑到OMP涉及大量的矩阵/矢量运算,因此非常适合在图形处理单元(GPU)上并行实现。在本文中,我们首先分析了OMP中每个模块的复杂性,指出了OMP的瓶颈在于投影模块和最小二乘模块。为了加快投影模块的速度,采用了Fujimoto的矩阵矢量乘法算法。为了加快最小二乘模块的速度,采用了矩阵逆更新算法。实验结果表明,通过在英特尔®酷睿™i7 CPU上在GTX480 GPU上实现OMP,可以实现+40倍的加速。由于投影模块占用了总运行时间的2/3以上,因此我们正在寻找一种更快的矩阵矢量乘法算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号