...
首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Semi-Supervised Cross-Modality Action Recognition by Latent Tensor Transfer Learning
【24h】

Semi-Supervised Cross-Modality Action Recognition by Latent Tensor Transfer Learning

机译:半监控跨越式动作识别潜伏传输学习

获取原文
获取原文并翻译 | 示例
           

摘要

Microsoft's Kinect sensors are receiving an increasing amount of interests by security researchers since they are cost-effective and can provide both visual and depth modality data at the same time. Unfortunately, depth or RGB modalities are unavailable in training or testing procedures in some realistic scenarios. Therefore, we explore a new problem focusing on the arbitrary absence of modality, which is completely different from the conventional action recognition. The new problem in action recognition aims to deal with cross modality data (e.g., RGB training and depth testing data), "missing" modality data (e.g., RGB training and RGB-D test data), and single-modality data (e.g., RGB/depth in both phases). Accordingly, our method aims to borrow some information (e.g., correlation between two modalities) from the well-established RGB-D dataset and apply it to the existing dataset to recover some latent information to improve the performance of recognition. For instance, a cross-modality regularizer is used to preserve the correlation of RGB and depth modalities. The "missing" knowledge is considered as latent information, which is recovered by low-rank learning in our model. In the real world, the target data are usually sparsely labeled or completely unlabeled; however, we could exploit the pseudolabels of the target as prior knowledge for "supervised" learning in the target domain. Accordingly, we propose a semi-supervised model for transfer learning. The experiments on three widely used RGB-D action datasets show that our method performs better than that of the state-of-the-art transfer learning methods in most cases in terms of accuracy and time efficiency.
机译:Microsoft的Kinect传感器正在通过安全研究人员获得越来越多的兴趣,因为它们具有成本效益,并且可以同时提供视觉和深度模态数据。不幸的是,在一些现实情景中的培训或测试程序中,深度或RGB模式不可用。因此,我们探索关注的新问题,这些问题涉及任意缺乏模态,这与传统的动作识别完全不同。行动识别中的新问题旨在处理跨模型数据(例如,RGB培训和深度测试数据),“缺少”模态数据(例如,RGB训练和RGB-D测试数据)和单模数据(例如,两个阶段的RGB /深度)。因此,我们的方法旨在从已建立的RGB-D数据集中借一些信息(例如,两个模式之间的相关性),并将其应用于现有数据集以恢复一些潜在信息以提高识别性能。例如,跨模型规范器用于保留RGB和深度模态的相关性。 “缺失”知识被视为潜在信息,这些信息被我们模型中的低级学习恢复。在现实世界中,目标数据通常稀疏标记或完全未标记;但是,我们可以利用目标的伪标签作为目标领域中“监督”学习的先验知识。因此,我们提出了一个半监督用于转移学习模型。三种广泛使用的RGB-D动作数据集的实验表明,在大多数情况下,我们的方法在大多数情况下比准确性和时间效率的案例更好地表现出更好的转移学习方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号