【24h】

Scene-Centric Joint Parsing of Cross-View Videos

机译:以场景为中心的横视视频解析

获取原文
获取外文期刊封面目录资料

摘要

Cross-view video understanding is an important yet under-explored area in computer vision. In this paper, we introduce a joint parsing framework that integrates view-centric proposals into scene-centric parse graphs that represent a coherent scene-centric understanding of cross-view scenes. Our key observations are that overlapping fields of views embed rich appearance and geometry correlations and that knowledge fragments corresponding to individual vision tasks are governed by consistency constraints available in commonsense knowledge. The proposed joint parsing framework represents such correlations and constraints explicitly and generates semantic scene-centric parse graphs. Quantitative experiments show that scene-centric predictions in the parse graph outperform view-centric predictions.
机译:跨视视频理解是计算机愿景中的一个重要尚未探讨的地区。 在本文中,我们介绍了一个联合解析框架,将视图中心的建议集成在以场景为中心的解析图,该图代表了对跨视场景的相干场景的理解。 我们的关键观察是重叠视图领域嵌入丰富的外观和几何相关性,并且对应于各个愿景任务的知识片段受到勤杂朗知识中可用的一致性约束来管理。 所提出的联合解析框架显式表示这些相关性和约束,并生成语义场景的解析图。 定量实验表明了解析图中的现场预测优于以视网形式为中心的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号