首页> 外文OA文献 >Understanding Videos, Constructing Plots Learning a Visually Grounded Storyline Model from Annotated Videos
【2h】

Understanding Videos, Constructing Plots Learning a Visually Grounded Storyline Model from Annotated Videos

机译:了解视频,构建剧情从带注释的视频中学习基于视觉的故事情节模型

摘要

Analyzing videos of human activities involves not only recognizing actions (typically based on their appearances), but also determining the story/plot of the video. The storyline of a video describes causal relationships between actions. Beyond recognition of individual actions, discovering causal relationships helps to better understand the semantic meaning of the activities. We present an approach to learn a visually grounded storyline model of videos directly from weakly labeled data. The storyline model is represented as an AND-OR graph, a structure that can compactly encode storyline variation across videos. The edges in the AND-OR graph correspond to causal relationships which are represented in terms of spatio-temporal constraints. We formulate an Integer Programming framework for action recognition and storyline extraction using the storyline model and visual groundings learned from training data.
机译:分析人类活动的视频不仅涉及识别动作(通常基于其外观),而且还涉及确定视频的故事/情节。视频的故事情节描述了动作之间的因果关系。除了识别单个动作之外,发现因果关系还有助于更好地理解活动的语义。我们提出了一种直接从弱标签数据中学习基于视觉的故事情节模型的方法。故事情节模型以AND-OR图表示,该结构可以紧凑地编码视频中故事情节的变化。 AND-OR图中的边对应于因果关系,这些因果关系以时空约束表示。我们使用故事情节模型和从训练数据中学到的视觉基础,为动作识别和故事情节提取制定了一个整数编程框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号