首页> 外文会议>International conference on biomimetic and biohybrid systems >Foveated Image Processing for Faster Object Detection and Recognition in Embedded Systems Using Deep Convolutional Neural Networks
【24h】

Foveated Image Processing for Faster Object Detection and Recognition in Embedded Systems Using Deep Convolutional Neural Networks

机译:利用深度卷积神经网络的嵌入式图像处理,可加快嵌入式系统中的物体检测和识别速度

获取原文

摘要

Object detection and recognition algorithms using deep convolutional neural networks (CNNs) tend to be computationally intensive to implement. This presents a particular challenge for embedded systems, such as mobile robots, where the computational resources tend to be far less than for workstations. As an alternative to standard, uniformly sampled images, we propose the use of foveated image sampling here to reduce the size of images, which are faster to process in a CNN due to the reduced number of convolution operations. We evaluate object detection and recognition on the Microsoft COCO database, using foveated image sampling at different image sizes, ranging from 416×416 to 96×96 pixels, on an embedded GPU - an NVIDIA Jetson TX2 with 256 CUDA cores. The results show that it is possible to achieve a 4 × speed-up in frame rates, from 3.59 FPS to 15.24 FPS, using 416 × 416 and 128 × 128 pixel images respectively. For foveated sampling, this image size reduction led to just a small decrease in recall performance in the foveal region, to 92.0% of the baseline performance with full-sized images, compared to a significant decrease to 50.1% of baseline recall performance in uniformly sampled images, demonstrating the advantage of foveated sampling.
机译:使用深度卷积神经网络(CNN)的对象检测和识别算法往往需要大量的计算才能实现。这对于嵌入式系统(例如移动机器人)提出了特别的挑战,在嵌入式系统中,计算资源往往比工作站要少得多。作为标准的均匀采样图像的替代方法,我们建议在此处使用偏心图像采样以减小图像尺寸,由于减少了卷积操作次数,因此可以在CNN中更快地进行处理。我们使用嵌入式GPU(具有256个CUDA内核的NVIDIA Jetson TX2)在不同的图像尺寸(从416×416到96×96像素)上使用集中的图像采样,在Microsoft COCO数据库上评估对象检测和识别。结果表明,分别使用416×416和128×128像素的图像,可以从3.59 FPS到15.24 FPS实现4倍的帧速率加速。对于集中采样,这种图像尺寸的减小导致中央凹区域的召回性能仅略有下降,与全尺寸图像相比,降低了基线性能的92.0%,而在统一采样中,显着降低了基线召回性能的50.1%图像,证明了集中采样的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号