首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images
【24h】

Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images

机译:牛顿图像理解:展现静态图像中对象的动态

获取原文

摘要

In this paper, we study the challenging problem of predicting the dynamics of objects in static images. Given a query object in an image, our goal is to provide a physical understanding of the object in terms of the forces acting upon it and its long term motion as response to those forces. Direct and explicit estimation of the forces and the motion of objects from a single image is extremely challenging. We define intermediate physical abstractions called Newtonian scenarios and introduce Newtonian Neural Network (N3) that learns to map a single image to a state in a Newtonian scenario. Our evaluations show that our method can reliably predict dynamics of a query object from a single image. In addition, our approach can provide physical reasoning that supports the predicted dynamics in terms of velocity and force vectors. To spur research in this direction we compiled Visual Newtonian Dynamics (VIND) dataset that includes more than 6000 videos aligned with Newtonian scenarios represented using game engines, and more than 4500 still images with their ground truth dynamics.
机译:在本文中,我们研究了预测静态图像中对象动态的挑战性问题。给定图像中的查询对象,我们的目标是根据作用在对象上的力及其对这些力的响应的长期运动来提供对对象的物理理解。从单个图像直接和显式估计对象的力和运动是极具挑战性的。我们定义了称为牛顿场景的中间物理抽象,并介绍了学会将单个图像映射到牛顿场景中的状态的牛顿神经网络(N3)。我们的评估表明,我们的方法可以从单个图像可靠地预测查询对象的动态。另外,我们的方法可以提供物理推理,从而支持速度和力矢量方面的预测动力学。为了推动这一方向的研究,我们编译了Visual Newtonian Dynamics(VIND)数据集,其中包括6000多个与使用游戏引擎表示的Newtonian场景对齐的视频,以及4500多个具有基本真实动态的静态图像。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号