Semi-Supervised Cross-Modality Action Recognition by Latent Tensor Transfer Learning

Jia Chengcheng; Ding Zhengming; Kong Yu; Fu Yun

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Semi-Supervised Cross-Modality Action Recognition by Latent Tensor Transfer Learning

【24h】

Semi-Supervised Cross-Modality Action Recognition by Latent Tensor Transfer Learning

机译：半监控跨越式动作识别潜伏传输学习

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Microsoft's Kinect sensors are receiving an increasing amount of interests by security researchers since they are cost-effective and can provide both visual and depth modality data at the same time. Unfortunately, depth or RGB modalities are unavailable in training or testing procedures in some realistic scenarios. Therefore, we explore a new problem focusing on the arbitrary absence of modality, which is completely different from the conventional action recognition. The new problem in action recognition aims to deal with cross modality data (e.g., RGB training and depth testing data), "missing" modality data (e.g., RGB training and RGB-D test data), and single-modality data (e.g., RGB/depth in both phases). Accordingly, our method aims to borrow some information (e.g., correlation between two modalities) from the well-established RGB-D dataset and apply it to the existing dataset to recover some latent information to improve the performance of recognition. For instance, a cross-modality regularizer is used to preserve the correlation of RGB and depth modalities. The "missing" knowledge is considered as latent information, which is recovered by low-rank learning in our model. In the real world, the target data are usually sparsely labeled or completely unlabeled; however, we could exploit the pseudolabels of the target as prior knowledge for "supervised" learning in the target domain. Accordingly, we propose a semi-supervised model for transfer learning. The experiments on three widely used RGB-D action datasets show that our method performs better than that of the state-of-the-art transfer learning methods in most cases in terms of accuracy and time efficiency.

机译：Microsoft的Kinect传感器正在通过安全研究人员获得越来越多的兴趣，因为它们具有成本效益，并且可以同时提供视觉和深度模态数据。不幸的是，在一些现实情景中的培训或测试程序中，深度或RGB模式不可用。因此，我们探索关注的新问题，这些问题涉及任意缺乏模态，这与传统的动作识别完全不同。行动识别中的新问题旨在处理跨模型数据（例如，RGB培训和深度测试数据），“缺少”模态数据（例如，RGB训练和RGB-D测试数据）和单模数据（例如，两个阶段的RGB /深度）。因此，我们的方法旨在从已建立的RGB-D数据集中借一些信息（例如，两个模式之间的相关性），并将其应用于现有数据集以恢复一些潜在信息以提高识别性能。例如，跨模型规范器用于保留RGB和深度模态的相关性。 “缺失”知识被视为潜在信息，这些信息被我们模型中的低级学习恢复。在现实世界中，目标数据通常稀疏标记或完全未标记;但是，我们可以利用目标的伪标签作为目标领域中“监督”学习的先验知识。因此，我们提出了一个半监督用于转移学习模型。三种广泛使用的RGB-D动作数据集的实验表明，在大多数情况下，我们的方法在大多数情况下比准确性和时间效率的案例更好地表现出更好的转移学习方法。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology》 |2020年第9期|2801-2814|共14页
作者
Jia Chengcheng; Ding Zhengming; Kong Yu; Fu Yun;
展开▼
作者单位

Northeastern Univ Dept Elect & Comp Engn Boston MA 02115 USA|Futurewei Technol Santa Clara CA 95050 USA;

Indiana Univ Purdue Univ Dept Comp Informat & Technol Indianapolis IN 46202 USA;

Rochester Inst Technol B Thomas Golisano Coll Comp & Informat Sci Rochester NY 14623 USA;

Northeastern Univ Dept Elect & Comp Engn Coll Engn Boston MA 02115 USA|Northeastern Univ Khoury Coll Comp & Informat Sci Boston MA 02115 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Correlation; Training; Feature extraction; Target recognition; Tensors; Testing; Semantics; RGB-D action; cross-modality; missing modality; latent information; semi-supervised; transfer learning; low-rank tensor;

机译：相关;特征提取;目标识别;张量;测试;语义;RGB-D行动;跨偶数;缺少的方式;潜在信息;半监督;转移学习;低级张量;

相似文献

外文文献
中文文献
专利

1. A Novel Angle-Based Learning Framework on Semi-supervised Dimensionality Reduction in High-Dimensional Data with Application to Action Recognition [J] . Zahra Ramezani, Ahmad Pourdarvish, Kiumars Teymourian Arabian Journal for Science and Engineering. Section A, Sciences . 2020,第12期

机译：基于角度的学习框架，用于动作识别的高维数据的半监督维度减少
2. 3D Features for human action recognition with semi-supervised learning [J] . Sahoo Suraj Prakash, Srinivasu Ulli, Ari Samit Image Processing, IET . 2019,第6期

机译：具有半监督学习功能的3D人体动作识别功能
3. Evaluation of semi-supervised learning method on action recognition [J] . Haoquan Shen, Yan Yan, Shicheng Xu, Multimedia Tools and Applications . 2015,第2期

机译：半监督学习方法对动作识别的评价
4. Learning to Transfer: Transferring Latent Task Structures and Its Application to Person-Specific Facial Action Unit Detection [C] . Timur Almaev, Brais Martinez, Michel Valstar IEEE International Conference on Computer Vision . 2015

机译：学习转移：转移潜在任务结构及其在特定于人的面部动作单元检测中的应用
5. Low-Rank Tensor Learning for Human Action Recognition. [D] . Jia, Chengcheng. 2016

机译：用于人类动作识别的低阶张量学习。
6. Meta-Transfer Learning Driven Tensor-Shot Detector for the Autonomous Localization and Recognition of Concealed Baggage Threats [O] . Taimur Hassan, Muhammad Shafay, Samet Akçay, 2020

机译：元转移学习驱动的张力射击探测器用于自主定位和识别隐藏行李威胁
7. Semi-Supervised Deep Transfer Learning-Based on Adversarial Feature Learning for Label Limited SAR Target Recognition [O] . Wei Zhang, Yongfeng Zhu, Qiang Fu 2019

机译：基于对抗性特征学习的半监督深度转移学习限制SAR目标识别

Semi-Supervised Cross-Modality Action Recognition by Latent Tensor Transfer Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅