首页> 外文会议>IEEE International Conference on Multimedia and Expo Workshops >MULTI-MODAL GM-PLSA AND ITS APPLICATION TO VIDEO CLASSIFICATION
【24h】

MULTI-MODAL GM-PLSA AND ITS APPLICATION TO VIDEO CLASSIFICATION

机译:多模态GM-PLSA及其在视频分类中的应用

获取原文

摘要

To extend standard probabilistic Latent Semantic Analysis (pLSA) to handle continuous quantity, pLSA with Gaussian Mixtures (GM-pLSA) has been proposed, which models the continuous features of terms via a Gaussian Mixture Model (GMM). Stemming from GM-pLSA, this paper presents a multi-modal GM-pLSA (MMGM-pLSA) model to deal with the situation where continuous features from multiple modalities are extracted from one term. Based on our assumption that the multi-modal features of one term independently come from the same latent aspect, multiple GMMs are introduced with each of them depicting the feature distribution of each modality. By doing so, the characteristic of each modality is captured and embodied. To evaluate the performance, a prototype of typical video classification is devised, in which each video clip is interpreted as one document and its sub-shots as terms. Experimental comparisons with other approaches demonstrate the effectiveness of MMGM-pLSA.
机译:为了扩展标准概率潜伏语义分析(PLSA)以处理连续数量,已经提出了具有高斯混合物(GM-PLSA)的PLSA,其通过高斯混合模型(GMM)模拟了术语的连续特征。来自GM-PLSA的源,本文介绍了一个多模态GM-PLSA(MMGM-PLSA)模型,以处理从一个术语中提取多种方式的连续特征的情况。基于我们假设一个术语独立地来自同一潜在观点的多模态特征,用它们中的每一个引入多个GMM,描绘了每个模态的特征分布。通过这样做,捕获并体现了每个模态的特征。为了评估性能,设计了典型视频分类的原型,其中每个视频剪辑被解释为一个文档及其子拍摄。与其他方法的实验比较证明了MMGM-PLSA的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号