首页> 外文会议>International Conference on Digital Image Computing: Techniques and Applications >Object Classification using Deep Learning on Extremely Low-Resolution Time-of-Flight Data
【24h】

Object Classification using Deep Learning on Extremely Low-Resolution Time-of-Flight Data

机译:对象分类使用深度学习极低学习的飞行时间 - 飞行时间数据

获取原文

摘要

This paper proposes two novel deep learning models for 2D and 3D classification of objects in extremely low-resolution time-of-flight imagery. The models have been developed to suit contemporary range imaging hardware based on a recently fabricated Single Photon Avalanche Diode (SPAD) camera with 64 χ 64 pixel resolution. Being the first prototype of its kind, only a small data set has been collected so far which makes it challenging for training models. To bypass this hurdle, transfer learning is applied to the widely used VGG-16 convolutional neural network (CNN), with supplementary layers added specifically to handle SPAD data. This classifier and the renowned Faster-RCNN detector offer benchmark models for comparison to a newly created 3D CNN operating on time-of-flight data acquired by the SPAD sensor. Another contribution of this work is the proposed shot noise removal algorithm which is particularly useful to mitigate the camera sensitivity in situations of excessive lighting. Models have been tested in both low-light indoor settings and outdoor daytime conditions, on eight objects exhibiting small physical dimensions, low reflectivity, featureless structures and located at ranges from 25m to 700m. Despite antagonist factors, the proposed 2D model has achieved 95% average precision and recall, with higher accuracy for the 3D model.
机译:本文提出了两种新颖的2D和3D分类的新型深度学习模型,在极低分辨率的飞行时间内图像中的2D对象分类。已经开发了该模型,以根据最近制造的单光子雪崩二极管(SPAD)相机为基于最近制造的单光子雪崩二极管(SPAD)相机,具有64÷64像素分辨率。作为其同类的第一个原型,迄今为止只收集了一个小型数据集,这使得它挑战训练模型。为了绕过这种障碍,转移学习应用于广泛使用的VGG-16卷积神经网络(CNN),并专门添加补充层来处理SPAD数据。该分类器和着名的Faster-RCNN检测器提供基准模型,以便与在由SPAD传感器获取的飞行时间数据上运行的新创建的3D CNN比较。这项工作的另一个贡献是所提出的射击噪声去除算法,其特别有用的是在过度照明的情况下减轻相机敏感性。模型已经在低光室内设置和室外日间条件下进行了测试,八个物体,呈现出小的物理尺寸,低反射率,无形象结构,位于25米至700米的范围内。尽管具有拮抗因素,所提出的2D模型已经实现了95%的平均精度和召回,具有更高的3D模型的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号