MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition

机译：MMSS：用于RGB-D对象识别的多模态可共享和特定特征学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most of the feature-learning methods for RGB-D object recognition either learn features from color and depth modalities separately, or simply treat RGB-D as undifferentiated four-channel data, which cannot adequately exploit the relationship between different modalities. Motivated by the intuition that different modalities should contain not only some modal-specific patterns but also some shared common patterns, we propose a multi-modalfeature learning frameworkfor RGB-D object recognition. Wefirst construct deep CNN layers for color and depth separately, and then connect them with our carefully designed multi-modal layers, which fuse color and depth information by enforcing a common part to be shared by features of different modalities. In this way, we obtain features reflecting shared properties as well as modal-specific properties in different modalities. The information of the multi-modal learning frameworks is back-propagated to the early CNN layers. Experimental results show that our proposed multi-modal feature learning method outperforms state-of-the-art approaches on two widely used RGB-D object benchmark datasets.

机译：RGB-D对象识别的大多数特征学习方法分别从颜色和深度模态学习特征，或者简单地将RGB-D视为未分化的四通道数据，这不能充分利用不同模式之间的关系。通过直觉的激励，不同的方式不仅要包含一些模态特定模式，还应包含一些共同的常见模式，我们向RGB-D对象识别提出了多模态性学习框架。 Wefirst分别构建深层CNN层，然后将它们与我们精心设计的多模态层连接，通过强制执行通过不同模式的特征来共享的共同部分来将颜色和深度信息熔化颜色和深度信息。以这种方式，我们获取反映共享属性的功能以及在不同模式中的模态特定属性。多模态学习框架的信息被返回到早期的CNN层。实验结果表明，我们所提出的多模态特征学习方法在两个广泛使用的RGB-D对象基准数据集上优于最先进的方法。

著录项

来源
《IEEE International Conference on Computer Vision》|2015年||共9页
会议地点
作者
Anran Wang; Jianfei Cai; Jiwen Lu; Tat-Jen Cham;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. Modality-specific and hierarchical feature learning for RGB-D hand-held object recognition [J] . Lv Xiong, Liu Xinda, Li Xiangyang, Multimedia Tools and Applications . 2017,第3期

机译：用于RGB-D手持对象识别的特定于模式的和分层的特征学习
2. Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition [J] . Wang Anran, Lu Jiwen, Cai Jianfei, Multimedia, IEEE Transactions on . 2015,第11期

机译：大距离多模态深度学习用于RGB-D对象识别
3. Multi-modal deep feature learning for RGB-D object detection [J] . Xu Xiangyang, Li Yuncheng, Wu Gangshan, Pattern Recognition: The Journal of the Pattern Recognition Society . 2017,第期

机译：RGB-D对象检测的多模态深度学习
4. MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition [C] . Anran Wang, Jianfei Cai, Jiwen Lu, IEEE International Conference on Computer Vision . 2015

机译：MMSS：用于RGB-D对象识别的多模式可共享和特定功能学习
5. Feature -specific imaging: Extensions to adaptive object recognition and active illumination based scene reconstruction [D] . Baheti, Pawan K. 2008

机译：特定于特征的成像：扩展到自适应对象识别和基于主动照明的场景重建
6. RGB-D Object Recognition Using Multi-Modal Deep Neural Network and DS Evidence Theory [O] . Hui Zeng, Bin Yang, Xiuqing Wang, 2019

机译：基于多模态深度神经网络和DS证据理论的RGB-D对象识别
7. RGB-D Scene Recognition via Spatial-Related Multi-Modal Feature Learning [O] . Zhitong Xiong, Yuan Yuan, Qi Wang 2019

机译：RGB-D通过空间相关的多模态特征学习识别

MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition

摘要

著录项

相似文献

相关主题

期刊订阅