Multi-modal Unsupervised Feature Learning for RGB-D Scene Labeling

机译：用于RGB-D场景标签的多模式无监督特征学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most of the existing approaches for RGB-D indoor scene labeling employ hand-crafted features for each modality independently and combine them in a heuristic manner. There has been some attempt on directly learning features from raw RGB-D data, but the performance is not satisfactory. In this paper, we adapt the unsupervised feature learning technique for RGB-D labeling as a multi-modality learning problem. Our learning framework performs feature learning and feature encoding simultaneously which significantly boosts the performance. By stacking basic learning structure, higher-level features are derived and combined with lower-level features for better representing RGB-D data. Experimental results on the benchmark NYU depth dataset show that our method achieves competitive performance, compared with state-of-the-art.

机译：现有的大多数RGB-D室内场景标记方法都为每种模态独立采用手工制作的功能，并以启发式方式将它们组合在一起。已经尝试从原始RGB-D数据直接学习特征，但是性能并不令人满意。在本文中，我们将无监督特征学习技术用于RGB-D标记，将其作为一种多模态学习问题。我们的学习框架同时执行特征学习和特征编码，从而大大提高了性能。通过堆叠基本学习结构，可以导出更高级别的功能并将其与更低级别的功能组合在一起，以更好地表示RGB-D数据。在基准NYU深度数据集上的实验结果表明，与最新技术相比，我们的方法具有竞争优势。

著录项

来源
《European conference on computer vision》|2014年|453-467|共15页
会议地点
作者
Anran Wang; Jiwen Lu; Gang Wang; Jianfei Cai; Tat-Jen Cham;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
RGB-D scene labeling; unsupervised feature learning; joint feature learning and encoding; multi-modality;

机译：RGB-D场景标签;无监督特征学习;联合特征学习和编码;多模态;

相似文献

外文文献
中文文献
专利

1. Multi-modal deep feature learning for RGB-D object detection [J] . Xu Xiangyang, Li Yuncheng, Wu Gangshan, Pattern Recognition: The Journal of the Pattern Recognition Society . 2017,第期

机译：RGB-D对象检测的多模态深度学习
2. ENDOSCOPY VIDEO SUMMARIZATION BASED ON MULTI-MODAL DESCRIPTORS AND POSSIBILISTIC UNSUPERVISED LEARNING AND FEATURE SUBSET WEIGHTING [J] . Mohamed Maher Ben Ismail, Ouiem Bchir, Ahmed Z. Emam Intelligent automation and soft computing . 2014,第3期

机译：基于多模态描述符和可能的无监督学习和特征子集加权的内窥镜视频汇总
3. MAPNet: Multi-modal attentive pooling network for RGB-D indoor scene classification (vol 90, pg 436, 2019) [J] . Li Yabei, Zhang Zhang, Cheng Yanhua, Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：MAPNET：用于RGB-D室内场景分类的多模态细心汇集网络（VOL 90，PG 436,2019）
4. Multi-modal Unsupervised Feature Learning for RGB-D Scene Labeling [C] . Anran Wang, Jiwen Lu, Gang Wang, ECCV 2014 . 2014

机译：RGB-D场景标签的多模态无监督特征学习
5. Object Recognition and Semantic Scene Labeling for RGB-D Data. [D] . Lai, Kevin Kar Wai. 2013

机译：RGB-D数据的对象识别和语义场景标记。
6. Robust RGB-D SLAM Using Point and Line Features for Low Textured Scene [O] . Yajing Zou, Amr Eldemiry, Yaxin Li, 2020

机译：强大的RGB-D SLAM使用点和线特征用于低纹理场景
7. RGB-D Scene Recognition via Spatial-Related Multi-Modal Feature Learning [O] . Zhitong Xiong, Yuan Yuan, Qi Wang 2019

机译：RGB-D通过空间相关的多模态特征学习识别

Multi-modal Unsupervised Feature Learning for RGB-D Scene Labeling

摘要

著录项

相似文献

相关主题

期刊订阅