Two-stream Siamese network with contrastive-center losses for RGB-D action recognition

Fan Chunxiao; Zhai Zhengyuan; Ming Yue; Tian Lei

首页> 外文期刊>Journal of electronic imaging >Two-stream Siamese network with contrastive-center losses for RGB-D action recognition

【24h】

Two-stream Siamese network with contrastive-center losses for RGB-D action recognition

机译：具有对比度中心损失的两流连体网络，用于RGB-D动作识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many fusion methods have been developed to improve the performance of action recognition with RGB and depth data, where learning conjoint representation of heterogeneous modalities by a single network has not been paid enough attention. We present an associated representation method for RGB-D action recognition using the siamese network with contrastive-center loss. First, some samples of each class and modality data are selected as the references to construct positive and negative pairs. Each positive pair consists of a training sample and its class reference, whereas the negative pair only involves different classes reference. Then these pairs are inputted to a two-stream siamese network to learn the collaborative representation of RGB and depth data. Two ranking losses, namely intramodal and cross-modal contrastive-center loss, are developed to impose similarity/dissimilarity metric on those pairs. Specifically, the intramodal contrastive-center loss measures the relationship between samples and references from RGB or depth data. The cross-modal contrastive-center loss measures the relationship of visual and depth features in a same low-dimensional space. Finally, the ranking losses and a softmax loss are jointly optimized for action recognition. The proposed method is evaluated on two large action datasets, LAP IsoGD and NTU RGB+D, and a smaller dataset, Sheffield Kinect gesture. The experimental results demonstrate that the proposed method surpasses most of the state-of-the-art methods. (C) 2019 SPIE and IS&T

机译：已经开发了许多融合方法来改善具有RGB和深度数据的动作识别的性能，其中通过单个网络来学习异构模态的联合表示尚未引起足够的重视。我们提出了一种使用暹罗网络进行对比中心损失的RGB-D动作识别的关联表示方法。首先，选择每个类别和形态数据的一些样本作为构建正负对的参考。每个正对都包含一个训练样本及其类别参考，而负对仅包含不同的类别参考。然后将这些对输入到两个流的暹罗网络中，以学习RGB和深度数据的协作表示。开发了两个等级损失，即模态内和跨模态的对比中心损失，以在这些对上施加相似性/不相似性度量。具体而言，模态内对比中心损失测量的是RGB或深度数据中样本与参考之间的关系。跨模态的对比中心损失度量了同一低维空间中视觉和深度特征的关系。最后，联合优化排名损失和softmax损失以进行动作识别。该方法在两个大型动作数据集LAP IsoGD和NTU RGB + D以及较小的数据集Sheffield Kinect手势上进行了评估。实验结果表明，所提出的方法超越了大多数最新技术。（C）2019 SPIE和IS＆T

著录项

来源
《Journal of electronic imaging》 |2019年第2期|023004.1-023004.14|共14页
作者
Fan Chunxiao; Zhai Zhengyuan; Ming Yue; Tian Lei;
展开▼
作者单位

Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing, Peoples R China;

Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing, Peoples R China;

Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing, Peoples R China;

Alibaba Grp, Beijing, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
siamese neural network; contrastive loss; classes reference; intramodal and cross-modal relationships; orthogonalization constraints; RGB-D action recognition;

机译：暹罗神经网络;对比损失;类别参照;模内和跨模态关系;正交约束;RGB-D动作识别;

相似文献

外文文献
中文文献
专利

1. Two-stream Siamese network with contrastive-center losses for RGB-D action recognition [J] . Fan Chunxiao, Zhai Zhengyuan, Ming Yue, Journal of electronic imaging . 2019,第2期

机译：双流暹罗网络，具有对比中心损耗的RGB-D动作识别
2. RGB-D Human Action Recognition of Deep Feature Enhancement and Fusion Using Two-Stream ConvNet [J] . Yun Liu, Ruidi Ma, Hui Li, Journal of Sensors . 2021,第a期

机译：RGB-D使用双流ConvNet的深度特征增强和融合的RGB-D
3. Human action recognition using two-stream attention based LSTM networks [J] . Dai Cheng, Liu Xingang, Lai Jinfeng Applied Soft Computing . 2020,第期

机译：人类行动识别基于两流关注的LSTM网络
4. Two-Stream Network with 3D Common-Specific Framework for RGB-D Action Recognition [C] . Xiaolei Qin, Yongxin Ge, Jinyuan Feng, IEEE SmartWorld Conference;IEEE Ubiquitous Intelligence Computing Conference;IEEE Advanced Trusted Computing Conference;IEEE Scalable Computing Communications Conference;Cloud Big Data Computing Conference;IEEE Internet of People Conference;IEEE Smart City Innovation Conference . 2019

机译：具有3D通用框架的RGB-D动作识别的两流网络
5. Deep Siamese Neural Networks for Facial Expression Recognition in the Wild [D] . ?Hayale, Wassan 2020

机译：深连体神经网络的人脸表情识别在野生
6. Reading Pictures Instead of Looking: RGB-D Image-Based Action Recognition via Capsule Network and Kalman Filter [O] . Botong Zhao, Yanjie Wang, Keke Su, 2021

机译：阅读图片而不是寻找：RGB-D基于图像的动作识别通过胶囊网络和卡尔曼滤波器
7. RGB-D Human Action Recognition of Deep Feature Enhancement and Fusion Using Two-Stream ConvNet [O] . Yun Liu, Ruidi Ma, Hui Li, 2021

机译：RGB-D使用双流ConvNet的深度特征增强和融合的RGB-D

Two-stream Siamese network with contrastive-center losses for RGB-D action recognition

摘要

著录项

相似文献

相关主题

期刊订阅