...
首页> 外文期刊>IEEE transactions on multimedia >Discovering Latent Discriminative Patterns for Multi-Mode Event Representation
【24h】

Discovering Latent Discriminative Patterns for Multi-Mode Event Representation

机译:发现多模式事件表示的潜在判别模式

获取原文
获取原文并翻译 | 示例
           

摘要

Representation of videos is essential since it conveys an understanding of video content and enables many higher level tasks to be tackled efficiently. However, it is challenging to propose a rational representation for complex event videos, as most video information is either noisy or redundant. In this paper, we propose a compact event representation method that can concisely describe the inner modes of events. We deem that an optimal event representation scheme should reflect the long-term and high-level visual semantics (visual topics) of events, so different from previous frame-level video semantics representation methods and concept-based video representation methods, we investigate the problem from the perspective of segment-level video representations. We then present three appealing properties of segment-level visual semantics. Based on the observation, we propose different algorithms that rely on a novel deep-visual-word-based video encoding method to discover latent discriminative patterns of events. Finally, our multi-mode event representation is obtained by concatenating the discovered patterns as inner modes. We adopt our event representation for representative event parts mining, which can highlight the visual topics of events and remarkably prune the raw videos. We validate our event representation method based on complex event detection task. Experimental results on two standard benchmarking datasets, MED11 and CCV Dataset, show that the proposed method can significantly outperform the state-of-the-art approaches.
机译:视频表示是必不可少的,因为它传达了对视频内容的理解,并使许多更高层次的任务得以有效解决。然而,由于大多数视频信息要么是嘈杂的,要么是多余的,因此为复杂事件视频提出合理的表示是一项挑战。在本文中,我们提出了一种紧凑的事件表示方法,可以简洁地描述事件的内部模式。我们认为最佳的事件表示方案应该反映事件的长期和高级视觉语义(视觉主题),因此与以前的帧级视频语义表示方法和基于概念的视频表示方法不同,我们研究了该问题。从段级视频表示的角度来看。然后,我们介绍了段级视觉语义的三个吸引人的属性。基于观察,我们提出了不同的算法,这些算法依赖于基于深度视觉词的新型视频编码方法来发现潜在的事件判别模式。最后,通过将发现的模式连接为内部模式来获得我们的多模式事件表示。我们采用事件表示法进行代表性事件部分的挖掘,可以突出事件的视觉主题并显着修剪原始视频。我们验证了基于复杂事件检测任务的事件表示方法。在两个标准基准数据集MED11和CCV数据集上的实验结果表明,所提出的方法可以显着优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号