Canonical Correlation Analysis Regularization: An Effective Deep Multiview Learning Baseline for RGB-D Object Recognition

Tang Lulu; Yang Zhi-Xin; Jia Kui

首页> 外文期刊>IEEE Transactions on Cognitive and Developmental Systems >Canonical Correlation Analysis Regularization: An Effective Deep Multiview Learning Baseline for RGB-D Object Recognition

【24h】

Canonical Correlation Analysis Regularization: An Effective Deep Multiview Learning Baseline for RGB-D Object Recognition

机译：典型相关分析正则化：RGB-D对象识别的有效深度多视图学习基准

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Object recognition methods based on multimodal data, color plus depth (RGB-D), usually treat each modality separately in feature extraction, which neglects implicit relations between two views and preserves noise from any view to the final representation. To address these limitations, we propose a novel canonical correlation analysis (CCA)-based multiview convolutional neural network (CNNs) framework for RGB-D object representation. The RGB and depth streams process corresponding images, respectively, then are connected by CCA module leading to a common-correlated feature space. In addition, to embed CCA into deep CNNs in a supervised manner, two different schemes are explored. One considers CCA as a regularization (CCAR) term adding to the loss function. However, solving CCA optimization directly is neither computationally efficient nor compatible with the mini-batch-based stochastic optimization. Thus, we further propose an approximation method of CCAR, using the obtained CCA projection matrices to replace the weights of feature concatenation layer at regular intervals. Such a scheme enjoys benefits of full CCAR and is efficient by amortizing its cost over many training iterations. Experiments on benchmark RGB-D object recognition datasets have shown that the proposed methods outperform most existing methods using the very same of their network architectures.

机译：基于多模式数据，颜色加深度（RGB-D）的对象识别方法通常在特征提取中分别对待每种模式，这会忽略两个视图之间的隐式关系，并保留从任何视图到最终表示的噪声。为了解决这些限制，我们提出了一种新颖的基于典范相关分析（CCA）的RGB-D对象表示的多视图卷积神经网络（CNN）框架。 RGB流和深度流分别处理对应的图像，然后通过CCA模块连接到通向相关的特征空间。另外，为了以有监督的方式将CCA嵌入到深层CNN中，探索了两种不同的方案。人们认为CCA是增加损失函数的正则化（CCAR）项。但是，直接解决CCA优化既无计算效率，也与基于小批量的随机优化不兼容。因此，我们进一步提出了一种CCAR近似方法，使用获得的CCA投影矩阵以规则的间隔替换特征级联层的权重。这样的方案享有完整的CCAR的好处，并且通过在许多培训迭代中摊销其成本而非常有效。在基准RGB-D对象识别数据集上进行的实验表明，所提出的方法使用与它们相同的网络体系结构，优于大多数现有方法。

著录项

来源
《IEEE Transactions on Cognitive and Developmental Systems》 |2019年第1期|107-118|共12页
作者
Tang Lulu; Yang Zhi-Xin; Jia Kui;
展开▼
作者单位

Univ Macau, Dept Electromech Engn, Fac Sci & Technol, Macau 999078, Peoples R China;

Univ Macau, Dept Electromech Engn, Fac Sci & Technol, Macau 999078, Peoples R China;

South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510641, Guangdong, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Canonical correlation analysis (CCA); deep learning; multiview feature learning; RGB-D object recognition;

机译：规范相关分析（CCA）;深度学习;多视图特色学习;RGB-D对象识别;
入库时间 2022-08-18 04:30:24

相似文献

外文文献
中文文献
专利

1. Canonical Correlation Analysis Regularization: An Effective Deep Multiview Learning Baseline for RGB-D Object Recognition [J] . Tang Lulu, Yang Zhi-Xin, Jia Kui IEEE Transactions on Cognitive and Developmental Systems . 2019,第1期

机译：规范相关分析正规化：RGB-D对象识别的有效深度多视图学习基线
2. Composite nonlinearmultiset canonical correlation analysis for multiview feature learning and recognition [J] . Yuan Yun-Hao, Shen Xiaobo, Li Yun, Concurrency and computation: practice and experience . 2021,第15期

机译：多视图特色学习与识别的复合非线性多种规范相关分析
3. Uniform and Variational Deep Learning for RGB-D Object Recognition and Person Re-Identification [J] . Ren Liangliang, Lu Jiwen, Feng Jianjiang, IEEE Transactions on Image Processing . 2019,第10期

机译：用于RGB-D对象识别和人员重新识别的统一变式深度学习
4. Supervised Deep Canonical Correlation Analysis for Multiview Feature Learning [C] . Yan Liu, Yun Li, Yun-Hao Yuan, International conference on neural information processing . 2017

机译：有监督的深度典范相关分析，用于多视图特征学习
5. Attention and Depth Hallucination for RGB-D Face Recognition with Deep Learning [D] . Uppal, Hardik. 2021

机译：深度学习的RGB-D人脸识别的关注和深度幻觉
6. An Effective 3D Shape Descriptor for Object Recognition with RGB-D Sensors [O] . Zhong Liu, Changchen Zhao, Xingming Wu, 2017

机译：用于RGB-D传感器的目标识别的有效3D形状描述符
7. Improving a Deep Learning based RGB-D Object Recognition Model by Ensemble Learning [O] . Aakerberg, Andreas, Nasrollahi, Kamal, Heder, Thomas 2017

机译：利用集成学习改进基于深度学习的RGB-D目标识别模型

Canonical Correlation Analysis Regularization: An Effective Deep Multiview Learning Baseline for RGB-D Object Recognition

摘要

著录项

相似文献

相关主题

期刊订阅