Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition

Wang Anran; Lu Jiwen; Cai Jianfei; Cham Tat-Jen; Wang Gang

首页> 外文期刊>Multimedia, IEEE Transactions on >Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition

【24h】

Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition

机译：大距离多模态深度学习用于RGB-D对象识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most existing feature learning-based methods for RGB-D object recognition either combine RGB and depth data in an undifferentiated manner from the outset, or learn features from color and depth separately, which do not adequately exploit different characteristics of the two modalities or utilize the shared relationship between the modalities. In this paper, we propose a general CNN-based multi-modal learning framework for RGB-D object recognition. We first construct deep CNN layers for color and depth separately, which are then connected with a carefully designed multi-modal layer. This layer is designed to not only discover the most discriminative features for each modality, but is also able to harness the complementary relationship between the two modalities. The results of the multi-modal layer are back-propagated to update parameters of the CNN layers, and the multi-modal feature learning and the back-propagation are iteratively performed until convergence. Experimental results on two widely used RGB-D object datasets show that our method for general multi-modal learning achieves comparable performance to state-of-the-art methods specifically designed for RGB-D data.

机译：现有的大多数基于特征学习的RGB-D对象识别方法要么从一开始就以无差别的方式组合RGB和深度数据，要么分别从颜色和深度学习特征，但这些方法不能充分利用两种模态的不同特征或无法利用模式之间的共享关系。在本文中，我们提出了一个基于CNN的通用多模式学习框架，用于RGB-D对象识别。我们首先分别构建用于颜色和深度的深CNN层，然后将其与精心设计的多模式层连接。该层旨在不仅发现每种模态的最具区别性的功能，而且还能够利用两种模态之间的互补关系。反向传播多模式层的结果以更新CNN层的参数，并反复执行多模式特征学习和反向传播，直到收敛为止。在两个广泛使用的RGB-D对象数据集上的实验结果表明，我们的通用多模式学习方法与为RGB-D数据专门设计的最新方法具有可比的性能。

著录项

来源
《Multimedia, IEEE Transactions on》 |2015年第11期|1887-1898|共12页
作者
Wang Anran; Lu Jiwen; Cai Jianfei; Cham Tat-Jen; Wang Gang;
展开▼
作者单位

School of Computer Engineering, Nanyang Technological University (NTU), Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep learning; RGB-D object recognition; large-margin feature learning; multi-modality;

机译：深度学习;RGB-D对象识别;大幅度特征学习;多模式;

相似文献

外文文献
中文文献
专利

1. Multi-modal deep feature learning for RGB-D object detection [J] . Xu Xiangyang, Li Yuncheng, Wu Gangshan, Pattern Recognition: The Journal of the Pattern Recognition Society . 2017,第期

机译：RGB-D对象检测的多模态深度学习
2. Canonical Correlation Analysis Regularization: An Effective Deep Multiview Learning Baseline for RGB-D Object Recognition [J] . Tang Lulu, Yang Zhi-Xin, Jia Kui IEEE Transactions on Cognitive and Developmental Systems . 2019,第1期

机译：典型相关分析正则化：RGB-D对象识别的有效深度多视图学习基准
3. Uniform and Variational Deep Learning for RGB-D Object Recognition and Person Re-Identification [J] . Ren Liangliang, Lu Jiwen, Feng Jianjiang, IEEE Transactions on Image Processing . 2019,第10期

机译：用于RGB-D对象识别和人员重新识别的统一变式深度学习
4. MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition [C] . Anran Wang, Jianfei Cai, Jiwen Lu, IEEE International Conference on Computer Vision . 2015

机译：MMSS：用于RGB-D对象识别的多模式可共享和特定功能学习
5. Attention and Depth Hallucination for RGB-D Face Recognition with Deep Learning [D] . Uppal, Hardik. 2021

机译：深度学习的RGB-D人脸识别的关注和深度幻觉
6. RGB-D Object Recognition Using Multi-Modal Deep Neural Network and DS Evidence Theory [O] . Hui Zeng, Bin Yang, Xiuqing Wang, 2019

机译：基于多模态深度神经网络和DS证据理论的RGB-D对象识别
7. Improving a Deep Learning based RGB-D Object Recognition Model by Ensemble Learning [O] . Aakerberg, Andreas, Nasrollahi, Kamal, Heder, Thomas 2017

机译：利用集成学习改进基于深度学习的RGB-D目标识别模型

Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition

摘要

著录项

相似文献

相关主题

期刊订阅