Incremental Cross-Modality deep learning for pedestrian recognition

机译：增量式跨模态深度学习用于行人识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In spite of the large number of existing methods, pedestrian detection remains an open challenge. In recent years, deep learning classification methods combined with multi-modality images within different fusion schemes have achieved the best performance. It was proven that the late-fusion scheme outperforms both direct and intermediate integration of modalities for pedestrian recognition. Hence, in this paper, we focus on improving the late-fusion scheme for pedestrian classification on the Daimler stereo vision data set. Each image modality, Intensity, Depth and Flow, is classified by an independent Convolutional Neural Network (CNN), the outputs of which are then fused by a Multi-layer Perceptron (MLP) before the recognition decision. We propose different methods based on Cross-Modality deep learning of CNNs: (1) a correlated model where a unique CNN is trained with Intensity, Depth and Flow images for each frame, (2) an incremental model where a CNN is trained with the first modality images frames, then a second CNN, initialized by transfer learning on the first one is trained on the second modality images frames, and finally a third CNN initialized on the second one, is trained on the last modality images frames. The experiments show that the incremental cross-modality deep learning of CNNs improves classification performances not only for each independent modality classifier, but also for the multi-modality classifier based on late-fusion. Different learning algorithms are also investigated.

机译：尽管存在大量现有方法，但是行人检测仍然是一个开放的挑战。近年来，在不同融合方案中结合多模式图像的深度学习分类方法取得了最佳性能。事实证明，后期融合方案优于行人识别方式的直接和中间集成。因此，在本文中，我们着重于改进戴姆勒立体视觉数据集上的行人分类的后期融合方案。每个图像模态（强度，深度和流量）由独立的卷积神经网络（CNN）进行分类，然后在识别决定之前由多层感知器（MLP）融合其输出。我们基于CNN的跨模态深度学习提出了不同的方法：（1）一个相关模型，其中使用每个帧的强度，深度和流图像训练唯一的CNN，（2）一个增量模型，其中使用CNN训练CNN首先在第二模态图像帧上训练通过在第一个模态图像帧上的转移学习初始化的第二CNN，最后在第二模态图像帧上训练在第二模态图像帧上初始化的第三CNN，最后在第二模态图像帧上训练第三CNN。实验表明，CNN的增量式跨模态深度学习不仅提高了每个独立模态分类器的分类性能，而且还改善了基于后期融合的多模态分类器的分类性能。还研究了不同的学习算法。

著录项

来源
《IEEE Intelligent Vehicles Symposium》|2017年|523-528|共6页
会议地点
作者
Dănuţ Ovidiu Pop; Alexandrina Rogozan; Fawzi Nashashibi; Abdelaziz Bensrhair;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Computer vision; Image motion analysis; Machine learning; Feature extraction; Stereo vision; Data models;

机译：训练;计算机视觉;图像运动分析;机器学习;特征提取;立体视觉;数据模型;

相似文献

外文文献
中文文献
专利

1. Pedestrian Recognition Using Cross-Modality Learning in Convolutional Neural Networks [J] . Pop Danut Ovidiu, Rogozan Alexandrina, Nashashibi Fawzi, Intelligent Transportation Systems Magazine, IEEE . 2021,第1期

机译：在卷积神经网络中使用跨模型学习的行人识别
2. A crosswalk pedestrian recognition system by using deep learning and zebra-crossing recognition techniques [J] . Software . 2020,第5期

机译：利用深度学习和斑马线识别技术的人行横道行人识别系统
3. Recognition of pedestrian trajectories and attributes with computer vision and deep learning techniques [J] . Peter Kok-Yiu Wong, Han Luo, Mingzhu Wang, Advanced engineering informatics . 2021,第Auga期

机译：用计算机视觉和深层学习技术认识人行道轨迹和属性
4. Incremental Cross-Modality deep learning for pedestrian recognition [C] . D?nu? Ovidiu Pop, Alexandrina Rogozan, Fawzi Nashashibi, IEEE Intelligent Vehicles Symposium . 2017

机译：行人识别的增量跨越模型深度学习
5. Pedestrian Detection Using Deep Learning Through A Dashcam [D] . Trivedi, Harshil Pareshkumar. 2019

机译：使用Dashcam深入学习的行人检测
6. Bright-field holography: cross-modality deep learning enables snapshot 3D imaging with bright-field contrast using a single hologram [O] . Yichen Wu, Yilin Luo, Gunvant Chaudhari, 2019

机译：明场全息图：跨模态深度学习可使用单个全息图实现具有明场对比度的快照3D成像
7. Incremental Cross-Modality Deep Learning for Pedestrian Recognition [O] . Pop, Danut Ovidiu, Rogozan, Alexandrina, Nashashibi, Fawzi, 2017

机译：行人识别的增量式跨模式深度学习

Incremental Cross-Modality deep learning for pedestrian recognition

摘要

著录项

相似文献

相关主题

期刊订阅