Evaluating multimedia features and fusion for example-based event detection

Gregory K. Myers; Ramesh Nallapati; Julien van Hout; Stephanie Pancoast; Ramakant Nevatia; Chen Sun; Amirhossein Habibian; Dennis C. Koelma; Koen E. A. van de Sande; Arnold W. M. Smeulders; Cees G. M. Snoek

首页> 外文期刊>Machine Vision and Applications >Evaluating multimedia features and fusion for example-based event detection

【24h】

Evaluating multimedia features and fusion for example-based event detection

机译：评估多媒体功能和融合以进行基于示例的事件检测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multimedia event detection (MED) is a challenging problem because of the heterogeneous content and variable quality found in large collections of Internet videos. To study the value of multimedia features and fusion for representing and learning events from a set of example video clips, we created SESAME, a system for video SEarch with Speed and Accuracy for Multimedia Events. SESAME includes multiple bag-of-words event classifiers based on single data types: low-level visual, motion, and audio features; high-level semantic visual concepts; and automatic speech recognition. Event detection performance was evaluated for each event classifier. The performance of low-level visual and motion features was improved by the use of difference coding. The accuracy of the visual concepts was nearly as strong as that of the low-level visual features. Experiments with a number of fusion methods for combining the event detection scores from these classifiers revealed that simple fusion methods, such as arithmetic mean, perform as well as or better than other, more complex fusion methods. SESAME's performance in the 2012 TRECVID MED evaluation was one of the best reported.

机译：多媒体事件检测（MED）是一个具有挑战性的问题，因为在大量Internet视频中发现了内容的异质性和可变的质量。为了研究多媒体功能和融合对于从一组示例视频剪辑中表示和学习事件的价值，我们创建了SESAME，这是一种用于视频搜索的系统，具有针对多媒体事件的速度和准确性。 SESAME包括基于单个数据类型的多个词袋事件分类器：低级视觉，运动和音频功能；高级语义视觉概念；和自动语音识别。针对每个事件分类器评估了事件检测性能。通过使用差异编码，可以改善低级视觉和运动功能的性能。视觉概念的准确性几乎与低级视觉特征的准确性一样强。使用多种融合方法对来自这些分类器的事件检测分数进行组合的实验表明，简单的融合方法（例如算术平均值）与其他更复杂的融合方法相比，性能更好。 SESAME在2012年TRECVID MED评估中的表现是最好的报告之一。

著录项

来源
《Machine Vision and Applications》 |2014年第1期|17-32|共16页
作者
Gregory K. Myers; Ramesh Nallapati; Julien van Hout; Stephanie Pancoast; Ramakant Nevatia; Chen Sun; Amirhossein Habibian; Dennis C. Koelma; Koen E. A. van de Sande; Arnold W. M. Smeulders; Cees G. M. Snoek;
展开▼
作者单位

SRI International (SRI), 333 Ravenswood Avenue, Menlo Park, CA 94025, USA;

SRI International (SRI), 333 Ravenswood Avenue, Menlo Park, CA 94025, USA IBM Thomas J Watson Research Center, 1101 Kitchawan Rd,Yorktown Heights, NY 10598, USA;

SRI International (SRI), 333 Ravenswood Avenue, Menlo Park, CA 94025, USA;

SRI International (SRI), 333 Ravenswood Avenue, Menlo Park, CA 94025, USA;

Institute for Robotics and Intelligent Systems, University of Southern California (USC), Los Angeles, CA 90089-0273, USA;

Institute for Robotics and Intelligent Systems, University of Southern California (USC), Los Angeles, CA 90089-0273, USA;

University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Multimedia event detection; Video retrieval; Content extraction; Difference coding; Late fusion;

机译：多媒体事件检测;视频检索;内容提取;差异编码;后期融合;

相似文献

外文文献
中文文献
专利

1. Multimedia event detection with multimodal feature fusion and temporal concept localization [J] . Sangmin Oh, Scott McCloskey, Ilseo Kim, Machine Vision and Applications . 2014,第1期

机译：具有多模式特征融合和时间概念定位的多媒体事件检测
2. Multimedia classification and event detection using double fusion [J] . Zhen-zhong Lan, Lei Bao, Shoou-I Yu, Multimedia Tools and Applications . 2014,第1期

机译：使用双重融合的多媒体分类和事件检测
3. Example-based image colorization via automatic feature selection and fusion [J] . Li Bo, Lai Yu-Kun, Rosin Paul L. Neurocomputing . 2017,第nova29期

机译：通过自动特征选择和融合实现基于示例的图像着色
4. Improving Detection of Acoustic Events Using Audiovisual Data and Feature Level Fusion [C] . T. Butko, C. Canton-Ferrer, C. Segura, International Speech Communication Association . 2009

机译：使用视听数据和特征级别融合改善声学事件的检测
5. Signal Fusion and Semantic Similarity Evaluation for Social Media Based Adverse Drug Event Detection [D] . Khaja, Hameeduddin Irfan. 2018

机译：基于社交媒体的不良药物事件检测信号融合与语义相似性评估
6. Mental fatigue level detection based on event related and visual evoked potentials features fusion in virtual indoor environment [O] . Hachem A. Lamti, Mohamed Moncef Ben Khelifa, Vincent Hugel 2019

机译：基于事件相关和视觉诱发电位的心理疲劳水平检测在虚拟室内环境中融合特征
7. Evaluating Multimedia Features and Fusion for Example-Based Event Detection [O] . Myers, G.K., Nallapati, R., van Hout, J., 2014

机译：评估多媒体功能和融合以进行基于示例的事件检测

Evaluating multimedia features and fusion for example-based event detection

摘要

著录项

相似文献

相关主题

期刊订阅