首页> 外文期刊>International Journal of Computer Vision >Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words
【24h】

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

机译:使用时空词对人类行为类别进行无监督学习

获取原文
获取原文并翻译 | 示例
           

摘要

We present a novel unsupervised learning method for human action categories. A video sequence is represented as a collection of spatial-temporal words by extracting space-time interest points. The algorithm automatically learns the probability distributions of the spatial-temporal words and the intermediate topics corresponding to human action categories. This is achieved by using latent topic models such as the probabilistic Latent Semantic Analysis (pLSA) model and Latent Dirichlet Allocation (LDA). Our approach can handle noisy feature points arisen from dynamic background and moving cameras due to the application of the probabilistic models. Given a novel video sequence, the algorithm can categorize and localize the human action(s) contained in the video. We test our algorithm on three challenging datasets: the KTH human motion dataset, the Weizmann human action dataset, and a recent dataset of figure skating actions. Our results reflect the promise of such a simple approach. In addition, our algorithm can recognize and localize multiple actions in long and complex video sequences containing multiple motions.
机译:我们为人类行为类别提出了一种新颖的无监督学习方法。通过提取时空兴趣点,将视频序列表示为时空单词的集合。该算法自动学习与人类行为类别相对应的时空单词和中间主题的概率分布。这可以通过使用潜在主题模型(例如,概率潜在语义分析(pLSA)模型和潜在狄利克雷分配(LDA))来实现。由于概率模型的应用,我们的方法可以处理由于动态背景和运动相机而产生的嘈杂特征点。给定一个新颖的视频序列,该算法可以对视频中包含的人类行为进行分类和定位。我们在三个具有挑战性的数据集上测试了我们的算法:KTH人体运动数据集,Weizmann人体动作数据集和最近的花样滑冰动作数据集。我们的结果反映了这种简单方法的前景。此外,我们的算法可以识别并定位包含多个动作的长而复杂的视频序列中的多个动作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号