首页> 外文会议>IEEE Intelligent Vehicles Symposium >Incremental Cross-Modality deep learning for pedestrian recognition
【24h】

Incremental Cross-Modality deep learning for pedestrian recognition

机译:增量式跨模态深度学习用于行人识别

获取原文

摘要

In spite of the large number of existing methods, pedestrian detection remains an open challenge. In recent years, deep learning classification methods combined with multi-modality images within different fusion schemes have achieved the best performance. It was proven that the late-fusion scheme outperforms both direct and intermediate integration of modalities for pedestrian recognition. Hence, in this paper, we focus on improving the late-fusion scheme for pedestrian classification on the Daimler stereo vision data set. Each image modality, Intensity, Depth and Flow, is classified by an independent Convolutional Neural Network (CNN), the outputs of which are then fused by a Multi-layer Perceptron (MLP) before the recognition decision. We propose different methods based on Cross-Modality deep learning of CNNs: (1) a correlated model where a unique CNN is trained with Intensity, Depth and Flow images for each frame, (2) an incremental model where a CNN is trained with the first modality images frames, then a second CNN, initialized by transfer learning on the first one is trained on the second modality images frames, and finally a third CNN initialized on the second one, is trained on the last modality images frames. The experiments show that the incremental cross-modality deep learning of CNNs improves classification performances not only for each independent modality classifier, but also for the multi-modality classifier based on late-fusion. Different learning algorithms are also investigated.
机译:尽管存在大量现有方法,但是行人检测仍然是一个开放的挑战。近年来,在不同融合方案中结合多模式图像的深度学习分类方法取得了最佳性能。事实证明,后期融合方案优于行人识别方式的直接和中间集成。因此,在本文中,我们着重于改进戴姆勒立体视觉数据集上的行人分类的后期融合方案。每个图像模态(强度,深度和流量)由独立的卷积神经网络(CNN)进行分类,然后在识别决定之前由多层感知器(MLP)融合其输出。我们基于CNN的跨模态深度学习提出了不同的方法:(1)一个相关模型,其中使用每个帧的强度,深度和流图像训练唯一的CNN,(2)一个增量模型,其中使用CNN训练CNN首先在第二模态图像帧上训练通过在第一个模态图像帧上的转移学习初始化的第二CNN,最后在第二模态图像帧上训练在第二模态图像帧上初始化的第三CNN,最后在第二模态图像帧上训练第三CNN。实验表明,CNN的增量式跨模态深度学习不仅提高了每个独立模态分类器的分类性能,而且还改善了基于后期融合的多模态分类器的分类性能。还研究了不同的学习算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号