首页> 外文会议>IEEE International Conference on Multimedia and Expo Workshops >Multi-modal GM-plsa and its application to video classification
【24h】

Multi-modal GM-plsa and its application to video classification

机译:多模式GM-plsa及其在视频分类中的应用

获取原文

摘要

To extend standard probabilistic Latent Semantic Analysis (pLSA) to handle continuous quantity, pLSA with Gaussian Mixtures (GM-pLSA) has been proposed, which models the continuous features of terms via a Gaussian Mixture Model (GMM). Stemming from GM-pLSA, this paper presents a multi-modal GM-pLSA (MMGM-pLSA) model to deal with the situation where continuous features from multiple modalities are extracted from one term. Based on our assumption that the multi-modal features of one term independently come from the same latent aspect, multiple GMMs are introduced with each of them depicting the feature distribution of each modality. By doing so, the characteristic of each modality is captured and embodied. To evaluate the performance, a prototype of typical video classification is devised, in which each video clip is interpreted as one document and its sub-shots as terms. Experimental comparisons with other approaches demonstrate the effectiveness of MMGM-pLSA.
机译:为了扩展标准概率潜在语义分析(pLSA)以处理连续量,已提出了具有高斯混合(GM-pLSA)的pLSA,该模型通过高斯混合模型(GMM)对项的连续特征进行建模。本文以GM-pLSA为基础,提出了一种多模式GM-pLSA(MMGM-pLSA)模型,以处理从一个术语中提取多种模式的连续特征的情况。基于我们的假设,即一个术语的多模态特征独立地来自相同的潜在方面,因此引入了多个GMM,每个GMM都描述了每种模态的特征分布。通过这样做,捕获并体现了每个模态的特征。为了评估性能,设计了一个典型的视频分类原型,其中每个视频剪辑都被解释为一个文档,而其子快照则被解释为术语。与其他方法的实验比较证明了MMGM-pLSA的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号