Combining multi-representation for multimedia event detection using co-training

Bin Yi; Yang Yang; Shen Fumin; Xu Xing

首页> 外文期刊>Neurocomputing >Combining multi-representation for multimedia event detection using co-training

【24h】

Combining multi-representation for multimedia event detection using co-training

机译：使用联合训练将多表示结合用于多媒体事件检测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, multimedia event detection has been attracting extensive research attention because of the exponential increase in volume of web video data. Traditional approaches usually utilize single visual representation, which may suffer from the problem of insufficient descriptive power. How to jointly employ multiple types of visual representation to facilitate multimedia event detection (MED) in videos remains an open problem. In this work, we propose a novel system for event detection based on combination of multi-view representations and co-training algorithm. Specifically, given several types of low-level visual features (i.e., Convolutional Neural Networks (CNNs) and Fisher vector), we first train an initial classifier for each type of visual feature. Then, we use these classifiers to separately predict labels of unlabeled videos, and those with consistent prediction are merged into the training set. We alternatively repeat the processes of training the classifiers and enlarging the training set until convergence, To investigate the relationship among different types of visual features, the prediction scores of the two classifiers are fused by a linear weighted fusion method. We evaluate our MED system on the TRECVID MED11 data set, and the experimental results have demonstrated the outstanding performance of the proposed approach as compared to several other state-of-the-art algorithms. (C) 2016 Elsevier B.V. All rights reserved.

机译：近年来，由于网络视频数据量呈指数级增长，多媒体事件检测已引起广泛的研究关注。传统方法通常使用单一的视觉表示，这可能会遭受描述能力不足的问题。如何联合使用多种类型的视觉表示来促进视频中的多媒体事件检测（MED）仍然是一个悬而未决的问题。在这项工作中，我们提出了一种基于多视图表示和协同训练算法的事件检测新系统。具体来说，给定几种类型的低级视觉特征（即卷积神经网络（CNN）和Fisher向量），我们首先为每种视觉特征训练一个初始分类器。然后，我们使用这些分类器分别预测未标记视频的标签，并将具有一致预测的视频合并到训练集中。我们也可以重复训练分类器并扩大训练集直到收敛的过程。为了研究不同类型的视觉特征之间的关系，通过线性加权融合方法将两个分类器的预测得分融合。我们在TRECVID MED11数据集上评估了我们的MED系统，并且实验结果证明了与其他几种最新算法相比，该方法的出色性能。（C）2016 Elsevier B.V.保留所有权利。

著录项

来源
《Neurocomputing》 |2016年第12期|11-18|共8页
作者
Bin Yi; Yang Yang; Shen Fumin; Xu Xing;
展开▼
作者单位

Univ Elect Sci & Technol China, Chengdu, Peoples R China;

Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China;

Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China;

Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Multimedia event detection; Convolutional neural network; Co-training;

机译：多媒体事件检测;卷积神经网络;协同训练;

相似文献

外文文献
中文文献
专利

1. New Trending Events Detection based on the Multi-Representation Index Tree Clustering [J] . Hui Song, Lifeng Wang, Baiyan Li, International Journal of Intelligent Systems and Applications . 2011,第3期

机译：基于多表示索引树聚类的新趋势事件检测
2. Detection of social events in streams of social multimedia [J] . Jonathon Hare, Sina Samangooei, Mahesan Niranjan, International Journal of Multimedia Information Retrieval . 2015,第4期

机译：在社交多媒体流中检测社交事件
3. Real-life events in multimedia: detection, representation, retrieval, and applications [J] . Vasileios Mezaris, Ansgar Scherp, Ramesh Jain, Multimedia Tools and Applications . 2014,第1期

机译：多媒体中的现实事件：检测，表示，检索和应用
4. Enabling Low-Resource Transfer Learning across COVID-19 Corpora by Combining Event-Extraction and Co-Training [C] . Alexander Spangher, Nanyun Peng, Jonathan May, Workshop on NLP for COVID-19 at ACL . 2020

机译：通过结合事件提取和共同培训，在Covid-19 Coress中实现低资源转移学习
5. Kinematic Reconstruction of ttH, H → bb Events at the LHC, and Science Outreach Through Multimedia Blogging [D] . Tan, Shao Min. 2018

机译：大型强子对撞机中ttH，H→bb事件的运动学重构以及通过多媒体博客进行的科学推广
6. Secure Access Control and Large Scale Robust Representation for Online Multimedia Event Detection [O] . Changyu Liu, Bin Lu, Huiling Li -1

机译：在线多媒体事件检测的安全访问控制和大规模鲁棒表示
7. Semi-supervised Learning for Automatic Prosodic Event Detection Using Co-training Algorithm [O] . Je Hun Jeon, Yang Liu 2010

机译：基于协同训练算法的韵律事件自动检测的半监督学习

Combining multi-representation for multimedia event detection using co-training

摘要

著录项

相似文献

相关主题

期刊订阅