Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images

机译：牛顿图像理解：展现静态图像中对象的动态

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we study the challenging problem of predicting the dynamics of objects in static images. Given a query object in an image, our goal is to provide a physical understanding of the object in terms of the forces acting upon it and its long term motion as response to those forces. Direct and explicit estimation of the forces and the motion of objects from a single image is extremely challenging. We define intermediate physical abstractions called Newtonian scenarios and introduce Newtonian Neural Network (N3) that learns to map a single image to a state in a Newtonian scenario. Our evaluations show that our method can reliably predict dynamics of a query object from a single image. In addition, our approach can provide physical reasoning that supports the predicted dynamics in terms of velocity and force vectors. To spur research in this direction we compiled Visual Newtonian Dynamics (VIND) dataset that includes more than 6000 videos aligned with Newtonian scenarios represented using game engines, and more than 4500 still images with their ground truth dynamics.

机译：在本文中，我们研究了预测静态图像中对象动态的挑战性问题。给定图像中的查询对象，我们的目标是根据作用在对象上的力及其对这些力的响应的长期运动来提供对对象的物理理解。从单个图像直接和显式估计对象的力和运动是极具挑战性的。我们定义了称为牛顿场景的中间物理抽象，并介绍了学会将单个图像映射到牛顿场景中的状态的牛顿神经网络（N3）。我们的评估表明，我们的方法可以从单个图像可靠地预测查询对象的动态。另外，我们的方法可以提供物理推理，从而支持速度和力矢量方面的预测动力学。为了推动这一方向的研究，我们编译了Visual Newtonian Dynamics（VIND）数据集，其中包括6000多个与使用游戏引擎表示的Newtonian场景对齐的视频，以及4500多个具有基本真实动态的静态图像。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2016年|3521-3529|共9页
会议地点
作者
Roozbeh Mottaghi; Hessam Bagherinezhad; Mohammad Rastegari; Ali Farhadi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Dynamics; Videos; Visualization; Games; Engines; Force; Cognition;

机译：动力学;视频;可视化;游戏;引擎;力;认知;
入库时间 2022-08-26 13:47:05

相似文献

外文文献
中文文献
专利

1. Understanding dynamic and static displays: using images to reason dynamically [J] . Sally Bogacz, J. Gregory Trafton Cognitive Systems Research . 2005,第1a4期

机译：了解动态和静态显示：使用图像进行动态推理
2. Operational Automatic Remote Sensing Image Understanding Systems: Beyond Geographic Object-Based and Object-Oriented Image Analysis (GEOBIA/GEOOIA). Part 2: Novel system Architecture, Information/Knowledge Representation, Algorithm Design and Implementation [J] . Andrea Baraldi, Luigi Boschetti Remote Sensing . 2012,第9期

机译：可操作的自动遥感影像理解系统：超越基于地理对象和面向对象的图像分析（GEOBIA / GEOOIA）。第2部分：新颖的系统架构，信息/知识表示，算法设计和实现
3. Operational Automatic Remote Sensing Image Understanding Systems: Beyond Geographic Object-Based and Object-Oriented Image Analysis (GEOBIA/GEOOIA). Part 2: Novel system Architecture, Information/Knowledge Representation, Algorithm Design and Implementation [J] . Andrea Baraldi, Luigi Boschetti Remote Sensing . 2012,第9期

机译：可操作的自动遥感影像理解系统：超越基于地理对象和面向对象的图像分析（GEOBIA / GEOOIA）。第2部分：新颖的系统架构，信息/知识表示，算法设计和实现
4. Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images [C] . Roozbeh Mottaghi, Hessam Bagherinezhad, Mohammad Rastegari, IEEE Conference on Computer Vision and Pattern Recognition . 2016

机译：牛顿图像理解：展开静态图像中对象的动态
5. Towards Object-Level Image Understanding: Detecting Objects of Interest from Images [D] . Shen, Xiaohui 2013

机译：迈向对象级图像理解：从图像中检测感兴趣的对象
6. Charge Transfer in Dynamical Biosystems or The Treacheryof (Static) Images [O] . David N. Beratan, *, Chaoren Liu, -1

机译：动态生物系统中的电荷转移（静态）图片的
7. Automated Diagnosis And Image Understanding With Object Extraction, Object Classification, And Inferencing In Retinal Images [O] . Michael Goldbaum, Saied Moezzi, Adam Taylor, 1996

机译：使用视网膜图像中的对象提取，对象分类和推理进行自动诊断和图像理解

Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images

摘要

著录项

相似文献

相关主题

期刊订阅