首页> 外文期刊>Multimedia, IEEE Transactions on >Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection
【24h】

Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection

机译:视频概念检测中多模式融合和概念融合的高效启发式方法

获取原文
获取原文并翻译 | 示例
           

摘要

Semantic models are widely used to bridge the semantic gap between low-level features and high-level features in video concept indexing. Multimodal fusion and concept fusion are two commonly used approaches in building semantic models. In the previous work, domain adaptation is neglected in multimodal fusion, and many probability maximization based and unsupervised concept fusion methods are counterintuitive since they do not incorporate subjective human intuition. In this paper, we present a new two-stage semantic model combining the multimodal fusion and the concept fusion incorporating human heuristics. In the multimodal fusion model, we employ a new generic unsupervised method, namely, domain adaptive linear combination (DALC), to update the linear combination (LC) weights by incorporating the differences of element distributions between training and testing domains. In the concept fusion model, a novel mechanical node equilibrium (NE) model is developed by using forces to model the concept correlations to update the score of concepts represented by nodes. It is intuitive and can incorporate multiple kinds of correlations simultaneously to construct more sophisticated semantic structure. Compared to other state-of-the-art supervised and unsupervised methods, the new model can use either unsupervised or supervised factors to significantly improve the mean inferred average precision (MAP) performance on all datasets.
机译:语义模型被广泛用于弥合视频概念索引中低级特征和高级特征之间的语义鸿沟。多模式融合和概念融合是构建语义模型的两种常用方法。在先前的工作中,域自适应在多模式融合中被忽略了,许多基于概率最大化和无监督概念融合的方法是违反直觉的,因为它们没有包含人的主观直觉。在本文中,我们提出了一个新的两阶段语义模型,该模型将多模式融合和结合了人类启发式的概念融合相结合。在多模式融合模型中,我们采用了一种新的通用无监督方法,即域自适应线性组合(DALC),通过合并训练域和测试域之间元素分布的差异来更新线性组合(LC)权重。在概念融合模型中,通过使用力对概念相关性进行建模以更新由节点表示的概念的分数,开发了一种新颖的机械节点平衡(NE)模型。它是直观的,可以同时合并多种关联以构造更复杂的语义结构。与其他最新的有监督和无监督方法相比,新模型可以使用无监督或受监督因素来显着提高所有数据集的平均推断平均精度(MAP)性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号