首页> 美国卫生研究院文献>other >Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction
【2h】

Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction

机译:情感视频检索:通过大规模分段特征提取在好莱坞电影中进行暴力检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Without doubt general video and sound, as found in large multimedia archives, carry emotional information. Thus, audio and video retrieval by certain emotional categories or dimensions could play a central role for tomorrow's intelligent systems, enabling search for movies with a particular mood, computer aided scene and sound design in order to elicit certain emotions in the audience, etc. Yet, the lion's share of research in affective computing is exclusively focusing on signals conveyed by humans, such as affective speech. Uniting the fields of multimedia retrieval and affective computing is believed to lend to a multiplicity of interesting retrieval applications, and at the same time to benefit affective computing research, by moving its methodology “out of the lab” to real-world, diverse data. In this contribution, we address the problem of finding “disturbing” scenes in movies, a scenario that is highly relevant for computer-aided parental guidance. We apply large-scale segmental feature extraction combined with audio-visual classification to the particular task of detecting violence. Our system performs fully data-driven analysis including automatic segmentation. We evaluate the system in terms of mean average precision (MAP) on the official data set of the MediaEval 2012 evaluation campaign's Affect Task, which consists of 18 original Hollywood movies, achieving up to .398 MAP on unseen test data in full realism. An in-depth analysis of the worth of individual features with respect to the target class and the system errors is carried out and reveals the importance of peak-related audio feature extraction and low-level histogram-based video analysis.
机译:毫无疑问,大型多媒体档案中的常规视频和声音可以传达情感信息。因此,按特定情感类别或维度进行音频和视频检索对于明天的智能系统可以发挥核心作用,从而可以搜索具有特定心情的电影,计算机辅助场景和声音设计,从而在观众中引起某些情感,等等。 ,情感计算方面的研究最多只专注于人类传达的信号,例如情感语音。相信将多媒体检索和情感计算的领域结合起来,可以带来许多有趣的检索应用程序,同时通过将其方法“从实验室外”转移到现实世界中的各种数据,可以使情感计算研究受益。在这项贡献中,我们解决了在电影中发现“令人不安”的场景的问题,这种场景与计算机辅助的父母指导高度相关。我们将大规模分段特征提取与视听分类相结合,用于检测暴力的特定任务。我们的系统执行完全由数据驱动的分析,包括自动分段。我们根据MediaEval 2012评估活动的“情感任务”的官方数据集对系统进行平均平均精度(MAP)评估,该任务由18部好莱坞原始电影组成,在完全真实的情况下,对看不见的测试数据可达到0.398 MAP。进行了针对目标类别和系统错误的单个功能价值的深入分析,并揭示了与峰值相关的音频特征提取和基于低级直方图的视频分析的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号