首页> 外文期刊>The Visual Computer >Auxiliary criterion conversion via spatiotemporal semantic encoding and feature entropy for action recognition
【24h】

Auxiliary criterion conversion via spatiotemporal semantic encoding and feature entropy for action recognition

机译:辅助标准转换通过时空语义编码和特征熵进行动作识别

获取原文
获取原文并翻译 | 示例
           

摘要

Video-based action recognition in realistic scenes is a core technology for human-computer interaction and smart surveillance. Although the trajectory features with the bag of visual words have confirmed promising performance, spatiotemporal interactive information cannot be effectively encoded which is valuable for classification. To address this issue, we propose a spatiotemporal semantic feature (ST-SF) and implement the conversion of it to the auxiliary criterion based on the information entropy theory. First, we present a text-based relevance analysis method to estimate the textual labels of objects most relevant to actions, which are employed to train the more targeted detectors based on the deep network. False detections are optimized by the inter-frame cooperativity and dynamic programming to construct the valid tubes. Then, we design the ST-SF to encode the interactive information, and the concept and calculation of feature entropy are defined based on the spatial distribution of ST-SFs on the training set. Finally, we achieve a two-stage classification strategy using the resulting decision gains. Experimental results on three publicly available datasets demonstrate that our method is robust and improves upon the state-of-the-art algorithms.
机译:现实场景中基于视频的动作识别是人机互动和智能监控的核心技术。虽然具有袋子的轨迹特征已经确认了有希望的性能,但不能有效地编码时空交互式信息,这对分类有价值。为了解决这个问题,我们提出了一种时空语义特征(ST-SF)并基于信息熵理论将其转换为辅助标准。首先,我们提出了一种基于文本的相关性分析方法来估计与动作最相关的对象的文本标签,这些对象是基于深网络训练更具针对性的检测器的对象。通过帧间协作和动态编程来优化假检测以构造有效管。然后,我们设计ST-SF来编码交互信息,并且特征熵的概念和计算是根据训练集上的ST-SFS的空间分布来定义的。最后,我们使用由此产生的决策增益实现了两阶段的分类策略。在三个公共数据集上的实验结果表明,我们的方法是强大的,并改善了最先进的算法。

著录项

  • 来源
    《The Visual Computer》 |2021年第7期|1673-1690|共18页
  • 作者单位

    Beijing Univ Technol Fac Informat Technol Beijing 100124 Peoples R China|Beijing Key Lab Computat Intelligence & Intellige Beijing 100124 Peoples R China|Minist Educ Engn Res Ctr Digital Community Beijing 100124 Peoples R China;

    Beijing Univ Technol Fac Informat Technol Beijing 100124 Peoples R China|Beijing Key Lab Computat Intelligence & Intellige Beijing 100124 Peoples R China|Minist Educ Engn Res Ctr Digital Community Beijing 100124 Peoples R China;

    Beijing Univ Technol Fac Informat Technol Beijing 100124 Peoples R China|Beijing Key Lab Computat Intelligence & Intellige Beijing 100124 Peoples R China|Minist Educ Engn Res Ctr Digital Community Beijing 100124 Peoples R China;

    Beijing Univ Technol Fac Informat Technol Beijing 100124 Peoples R China|Beijing Key Lab Computat Intelligence & Intellige Beijing 100124 Peoples R China|Minist Educ Engn Res Ctr Digital Community Beijing 100124 Peoples R China;

    Beijing Univ Technol Fac Informat Technol Beijing 100124 Peoples R China|Beijing Key Lab Computat Intelligence & Intellige Beijing 100124 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Action recognition; Spatiotemporal semantic feature; Feature entropy; Bag-of-visual-words model; Text-based relevance analysis;

    机译:行动识别;时空语义特征;特征熵;袋 - 视觉单词模型;基于文本的相关性分析;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号