...
首页> 外文期刊>Journal of Real-Time Image Processing >A framework for accelerating local feature extraction with OpenCL on multi-core CPUs and co-processors
【24h】

A framework for accelerating local feature extraction with OpenCL on multi-core CPUs and co-processors

机译:在多核CPU和协处理器上使用OpenCL加速局部特征提取的框架

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper, we examined heterogeneous architectures, for their suitability to run the scale invariant feature transformation (SIFT) algorithm in real time. The SIFT is one of the most robust as well as one of the most computational intensive algorithms to extract local features in many machine-vision applications. Many ongoing researches presented methods on improving the SIFT execution time. However, described techniques focus only on improving the SIFT execution time on a single homogeneous device. To address the gap in improving SIFT algorithm execution time on multi-device heterogeneous platforms we have prepared the OpenCL-SIFT implementation. We have described techniques to efficiently parallelize the application that contains many different computing cores. By a careful optimization process, we presented the performance portable implementation, for an efficient processing on various multi-device heterogeneous platforms. The experimental results showed that our implementation obtains appropriate accuracy and higher efficiency compared to recent open-source SIFT implementations. Using proposed methods we extracted SIFT features with more than 30 FPS on Full-HD images with different processor architectures. Additionally to increase the performance, we showed efficient (in average speed-up of 2.69x) multi-device scheduling methods for SIFT feature extraction. Finally, we described guidelines to optimize GPGPU-OpenCL programs for x86 multi-core CPUs. The discussed methods are generic and may be used for the design of other algorithms.
机译:在本文中,我们研究了异构体系结构是否适合实时运行尺度不变特征变换(SIFT)算法。 SIFT是在许多机器视觉应用程序中提取局部特征的最强大的算法之一,也是计算量最大的算法之一。许多正在进行的研究提出了改善SIFT执行时间的方法。但是,所描述的技术仅专注于改善单个同类设备上的SIFT执行时间。为了解决在多设备异构平台上缩短SIFT算法执行时间的差距,我们准备了OpenCL-SIFT实现。我们已经描述了有效并行化包含许多不同计算核心的应用程序的技术。通过精心的优化过程,我们提出了性能可移植的实现,以在各种多设备异构平台上进行有效处理。实验结果表明,与最近的开源SIFT实现相比,我们的实现获得了适当的准确性和更高的效率。使用提出的方法,我们在具有不同处理器体系结构的全高清图像上提取了具有30 FPS以上的SIFT功能。为了提高性能,我们展示了用于SIFT特征提取的高效(平均速度为2.69倍)多设备调度方法。最后,我们介绍了针对x86多核CPU优化GPGPU-OpenCL程序的准则。讨论的方法是通用的,可用于其他算法的设计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号