...
首页> 外文期刊>IEEE transactions on multimedia >Discovering Latent Discriminative Patterns for Multi-Mode Event Representation
【24h】

Discovering Latent Discriminative Patterns for Multi-Mode Event Representation

机译:发现多模式事件表示的潜在鉴别模式

获取原文
获取原文并翻译 | 示例
           

摘要

Representation of videos is essential since it conveys an understanding of video content and enables many higher level tasks to be tackled efficiently. However, it is challenging to propose a rational representation for complex event videos, as most video information is either noisy or redundant. In this paper, we propose a compact event representation method that can concisely describe the inner modes of events. We deem that an optimal event representation scheme should reflect the long-term and high-level visual semantics (visual topics) of events, so different from previous frame-level video semantics representation methods and concept-based video representation methods, we investigate the problem from the perspective of segment-level video representations. We then present three appealing properties of segment-level visual semantics. Based on the observation, we propose different algorithms that rely on a novel deep-visual-word-based video encoding method to discover latent discriminative patterns of events. Finally, our multi-mode event representation is obtained by concatenating the discovered patterns as inner modes. We adopt our event representation for representative event parts mining, which can highlight the visual topics of events and remarkably prune the raw videos. We validate our event representation method based on complex event detection task. Experimental results on two standard benchmarking datasets, MED11 and CCV Dataset, show that the proposed method can significantly outperform the state-of-the-art approaches.
机译:视频的表示是必不可少的,因为它传达了对视频内容的理解,并且能够有效地解决许多更高的级别任务。然而,提出复杂事件视频的合理表示是挑战,因为大多数视频信息都是嘈杂的或冗余。在本文中,我们提出了一种紧凑的事件表示方法,可以简明地描述事件的内部模式。我们认为,最佳事件表示方案应反映事件的长期和高级视觉语义(视觉主题),与之前的帧级视频语义表示方法和基于概念的视频表示方法不同,我们调查了这个问题从段级视频表示的角度来看。然后我们提出了三个段级视觉语义的吸引人的属性。基于观察,我们提出了不同的算法依赖于基于新的深视网型的视频编码方法来发现潜在的事件模式。最后,通过将发现的模式连接为内模式来获得我们的多模式事件表示。我们通过我们的活动代表代表事件零件挖掘,可以突出显示事件的视觉主题,并显着修剪原始视频。基于复杂事件检测任务,我们验证了我们的事件表示方法。两个标准基准数据集,MED11和CCV数据集的实验结果表明,该方法可以显着优于最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号