首页> 外文期刊>Journal of Real-Time Image Processing >Real-time multitarget tracking for sensor-based sorting: A new implementation of the auction algorithm for graphics processing units
【24h】

Real-time multitarget tracking for sensor-based sorting: A new implementation of the auction algorithm for graphics processing units

机译:实时多目标跟踪,基于传感器的排序:图形处理单元拍卖算法的新实现

获取原文
获取原文并翻译 | 示例
           

摘要

Utilizing parallel algorithms is an established way of increasing performance in systems that are bound to real-time restrictions. Sensor-based sorting is a machine vision application for which firm real-time requirements need to be respected in order to reliably remove potentially harmful entities from a material feed. Recently, employing a predictive tracking approach using multitarget tracking in order to decrease the error in the physical separation in optical sorting has been proposed. For implementations that use hard associations between measurements and tracks, a linear assignment problem has to be solved for each frame recorded by a camera. The auction algorithm can be utilized for this purpose, which also has the advantage of being well suited for parallel architectures. In this paper, an improved implementation of this algorithm for a graphics processing unit (GPU) is presented. The resulting algorithm is implemented in both an OpenCL and a CUDA based environment. By using an optimized data structure, the presented algorithm outperforms recently proposed implementations in terms of speed while retaining the quality of output of the algorithm. Furthermore, memory requirements are significantly decreased, which is important for embedded systems. Experimental results are provided for two different GPUs and six datasets. It is shown that the proposed approach is of particular interest for applications dealing with comparatively large problem sizes.
机译:利用并行算法是在受实时限制约束的系统中提高性能的既定方法。基于传感器的分拣是一种机器视觉应用程序,对于该应用程序,必须遵守严格的实时要求,以便从物料进料中可靠地除去潜在有害的实体。近来,已经提出了使用多目标跟踪的预测跟踪方法,以减少光学分类中的物理分离中的误差。对于在测量和轨迹之间使用硬关联的实现,必须为相机记录的每个帧解决线性分配问题。拍卖算法可以用于此目的,其优点还在于非常适合于并行架构。在本文中,提出了针对图形处理单元(GPU)的该算法的改进实现。生成的算法在基于OpenCL和基于CUDA的环境中实现。通过使用优化的数据结构,本文提出的算法在速度方面优于最近提出的实现,同时保留了算法输出的质量。此外,显着降低了内存需求,这对于嵌入式系统很重要。提供了两个不同的GPU和六个数据集的实验结果。结果表明,所提出的方法对于处理较大问题的应用特别有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号