首页>
外文OA文献
>Understanding Videos, Constructing Plots Learning a Visually Grounded Storyline Model from Annotated Videos
【2h】
Understanding Videos, Constructing Plots Learning a Visually Grounded Storyline Model from Annotated Videos
展开▼
机译:了解视频,构建剧情从带注释的视频中学习基于视觉的故事情节模型
展开▼
免费
页面导航
摘要
著录项
相似文献
相关主题
摘要
Analyzing videos of human activities involves not only recognizing actions (typically based on their appearances), but also determining the story/plot of the video. The storyline of a video describes causal relationships between actions. Beyond recognition of individual actions, discovering causal relationships helps to better understand the semantic meaning of the activities. We present an approach to learn a visually grounded storyline model of videos directly from weakly labeled data. The storyline model is represented as an AND-OR graph, a structure that can compactly encode storyline variation across videos. The edges in the AND-OR graph correspond to causal relationships which are represented in terms of spatio-temporal constraints. We formulate an Integer Programming framework for action recognition and storyline extraction using the storyline model and visual groundings learned from training data.
展开▼