...
首页> 外文期刊>Machine Vision and Applications >Evaluating multimedia features and fusion for example-based event detection
【24h】

Evaluating multimedia features and fusion for example-based event detection

机译:评估多媒体功能和融合以进行基于示例的事件检测

获取原文
获取原文并翻译 | 示例
           

摘要

Multimedia event detection (MED) is a challenging problem because of the heterogeneous content and variable quality found in large collections of Internet videos. To study the value of multimedia features and fusion for representing and learning events from a set of example video clips, we created SESAME, a system for video SEarch with Speed and Accuracy for Multimedia Events. SESAME includes multiple bag-of-words event classifiers based on single data types: low-level visual, motion, and audio features; high-level semantic visual concepts; and automatic speech recognition. Event detection performance was evaluated for each event classifier. The performance of low-level visual and motion features was improved by the use of difference coding. The accuracy of the visual concepts was nearly as strong as that of the low-level visual features. Experiments with a number of fusion methods for combining the event detection scores from these classifiers revealed that simple fusion methods, such as arithmetic mean, perform as well as or better than other, more complex fusion methods. SESAME's performance in the 2012 TRECVID MED evaluation was one of the best reported.
机译:多媒体事件检测(MED)是一个具有挑战性的问题,因为在大量Internet视频中发现了内容的异质性和可变的质量。为了研究多媒体功能和融合对于从一组示例视频剪辑中表示和学习事件的价值,我们创建了SESAME,这是一种用于视频搜索的系统,具有针对多媒体事件的速度和准确性。 SESAME包括基于单个数据类型的多个词袋事件分类器:低级视觉,运动和音频功能;高级语义视觉概念;和自动语音识别。针对每个事件分类器评估了事件检测性能。通过使用差异编码,可以改善低级视觉和运动功能的性能。视觉概念的准确性几乎与低级视觉特征的准确性一样强。使用多种融合方法对来自这些分类器的事件检测分数进行组合的实验表明,简单的融合方法(例如算术平均值)与其他更复杂的融合方法相比,性能更好。 SESAME在2012年TRECVID MED评估中的表现是最好的报告之一。

著录项

  • 来源
    《Machine Vision and Applications》 |2014年第1期|17-32|共16页
  • 作者单位

    SRI International (SRI), 333 Ravenswood Avenue, Menlo Park, CA 94025, USA;

    SRI International (SRI), 333 Ravenswood Avenue, Menlo Park, CA 94025, USA IBM Thomas J Watson Research Center, 1101 Kitchawan Rd,Yorktown Heights, NY 10598, USA;

    SRI International (SRI), 333 Ravenswood Avenue, Menlo Park, CA 94025, USA;

    SRI International (SRI), 333 Ravenswood Avenue, Menlo Park, CA 94025, USA;

    Institute for Robotics and Intelligent Systems, University of Southern California (USC), Los Angeles, CA 90089-0273, USA;

    Institute for Robotics and Intelligent Systems, University of Southern California (USC), Los Angeles, CA 90089-0273, USA;

    University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

    University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

    University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

    University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

    University of Amsterdam (UvA), Science Park 904, P.O. Box 94323, Amsterdam 1098 GH, The Netherlands;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Multimedia event detection; Video retrieval; Content extraction; Difference coding; Late fusion;

    机译:多媒体事件检测;视频检索;内容提取;差异编码;后期融合;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号