Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection

Geng Jie; Miao Zhenjiang; Zhang Xiao-Ping

首页> 外文期刊>Multimedia, IEEE Transactions on >Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection

【24h】

Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection

机译：视频概念检测中多模式融合和概念融合的高效启发式方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Semantic models are widely used to bridge the semantic gap between low-level features and high-level features in video concept indexing. Multimodal fusion and concept fusion are two commonly used approaches in building semantic models. In the previous work, domain adaptation is neglected in multimodal fusion, and many probability maximization based and unsupervised concept fusion methods are counterintuitive since they do not incorporate subjective human intuition. In this paper, we present a new two-stage semantic model combining the multimodal fusion and the concept fusion incorporating human heuristics. In the multimodal fusion model, we employ a new generic unsupervised method, namely, domain adaptive linear combination (DALC), to update the linear combination (LC) weights by incorporating the differences of element distributions between training and testing domains. In the concept fusion model, a novel mechanical node equilibrium (NE) model is developed by using forces to model the concept correlations to update the score of concepts represented by nodes. It is intuitive and can incorporate multiple kinds of correlations simultaneously to construct more sophisticated semantic structure. Compared to other state-of-the-art supervised and unsupervised methods, the new model can use either unsupervised or supervised factors to significantly improve the mean inferred average precision (MAP) performance on all datasets.

机译：语义模型被广泛用于弥合视频概念索引中低级特征和高级特征之间的语义鸿沟。多模式融合和概念融合是构建语义模型的两种常用方法。在先前的工作中，域自适应在多模式融合中被忽略了，许多基于概率最大化和无监督概念融合的方法是违反直觉的，因为它们没有包含人的主观直觉。在本文中，我们提出了一个新的两阶段语义模型，该模型将多模式融合和结合了人类启发式的概念融合相结合。在多模式融合模型中，我们采用了一种新的通用无监督方法，即域自适应线性组合（DALC），通过合并训练域和测试域之间元素分布的差异来更新线性组合（LC）权重。在概念融合模型中，通过使用力对概念相关性进行建模以更新由节点表示的概念的分数，开发了一种新颖的机械节点平衡（NE）模型。它是直观的，可以同时合并多种关联以构造更复杂的语义结构。与其他最新的有监督和无监督方法相比，新模型可以使用无监督或受监督因素来显着提高所有数据集的平均推断平均精度（MAP）性能。

著录项

来源
《Multimedia, IEEE Transactions on》 |2015年第4期|498-511|共14页
作者
Geng Jie; Miao Zhenjiang; Zhang Xiao-Ping;
展开▼
作者单位

Institute of Information Science, Beijing Jiaotong University, Beijing, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Adaptation models; Correlation; Detectors; Histograms; Indexing; Semantics; Vectors; Concept fusion; domain adaption; multimodal fusion; video concept indexing;

机译：适应模型;相关性;检测器;直方图;索引;语义;向量;概念融合;域自适应;多峰融合;视频概念索引;

相似文献

外文文献
中文文献
专利

1. Multi-label semantic concept detection in videos using fusion of asymmetrically trained deep convolutional neural networks and foreground driven concept co-occurrence matrix [J] . Janwe Nitin J., Bhoyar Kishor K. Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2018,第8期

机译：使用非对称训练的深卷积神经网络和前景驱动概念共发生矩阵的视频中的多标签语义概念检测
2. Online Reranking via Ordinal Informative Concepts for Context Fusion in Concept Detection and Video Search [J] . Yang Y.-H., Hsu W. H., Chen H. H. Circuits and Systems for Video Technology, IEEE Transactions on . 2009,第12期

机译：在概念检测和视频搜索中通过有序信息概念进行在线重排以进行上下文融合
3. Multimodal recognition of visual concepts using histograms of textual concepts and selective weighted late fusion scheme [J] . Ningning Liu, Emmanuel Dellandrea, Liming Chen, Computer vision and image understanding . 2013,第5期

机译：使用文本概念直方图和选择性加权后期融合方案对视觉概念进行多模式识别
4. Video Semantic Concept Detection Based on MultiModality Fusion [C] . Zhao Jianxun, Wu BO International Conference on Computer Science and Electronics Engineering . 2012

机译：基于多模态融合的视频语义概念检测
5. The Johari Wonderland: The Fusion of Classic Literature and Film to Enhance Key Group Counseling Concepts and Counselor Reflexivity [D] . Parker, Jan L. 2018

机译：Johari仙境：经典文学与电影的融合，以增强关键群体的咨询概念和咨询顾问的反思能力
6. A Real-Time Fire Detection Method from Video with Multifeature Fusion [O] . Faming Gong, Chuantao Li, Wenjuan Gong, 2019

机译：具有多特征融合的视频实时火灾检测方法
7. Online reranking via ordinal informative concepts for context fusion in concept detection and video search [O] . Yi-hsuan Yang, Winston H. Hsu, Homer H. Chen 2009

机译：通过有序信息概念进行在线重排，以在概念检测和视频搜索中进行上下文融合
8. Zero shot Event Detection using Multi modal Fusion of Weakly Supervised Concepts (Open Access). [R] . Wu, S., Bondugula, S., Luisier, F., 2014

机译：使用弱模式融合的弱监督概念（开放存取）进行零射击事件检测。

Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept Detection

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅