首页> 外文期刊>Expert Systems with Application >Multimodal and ontology-based fusion approaches of audio and visual processing for violence detection in movies
【24h】

Multimodal and ontology-based fusion approaches of audio and visual processing for violence detection in movies

机译:电影中暴力检测的视听处理的多模式和基于本体的融合方法

获取原文
获取原文并翻译 | 示例

摘要

In this paper we present our research results towards the detection of violent scenes in movies, employing advanced fusion methodologies, based on learning, knowledge representation and reasoning. Towards this goal, a multi-step approach is followed: initially, automated audio and visual analysis is performed to extract audio and visual cues. Then, two different fusion approaches are deployed: (i) a multimodal one that provides binary decisions on the existence of violence or not, employing machine learning techniques, (ii) an ontological and reasoning one, that combines the audio-visual cues with violence and multimedia ontologies. The latter reasons out not only the existence of violence or not in a video scene, but also the type of violence (fight, screams, gunshots). Both approaches are experimentally tested, validated and compared for the binary decision problem of violence detection. Finally, results for the violence type identification are presented for the ontological fusion approach. For evaluation purposes, a large dataset of real movie data has been populated.
机译:在本文中,我们介绍了基于先进的融合方法,基于学习,知识表示和推理的电影中暴力场景检测的研究成果。为了实现这一目标,我们采取了多步骤方法:首先,执行自动音频和视频分析以提取音频和视频提示。然后,部署了两种不同的融合方法:(i)多模式方法,使用机器学习技术对是否存在暴力行为提供二元决策,(ii)本体论和推理方法,将视听线索与暴力行为相结合和多媒体本体。后者不仅说明在视频场景中是否存在暴力,而且还说明了暴力的类型(战斗,尖叫,枪声)。对于暴力检测的二元决策问题,这两种方法均经过实验测试,验证和比较。最后,针对本体融合方法给出了暴力类型识别的结果。为了评估的目的,已经填充了真实电影数据的大型数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号