Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks

Xue Tianfan; Wu Jiajun; Bouman Katherine L.; Freeman William T.

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks

【24h】

Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks

机译：视觉动力学：通过分层交叉卷积网络的随机未来生成

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We study the problem of synthesizing a number of likely future frames from a single input image. In contrast to traditional methods that have tackled this problem in a deterministic or non-parametric way, we propose to model future frames in a probabilistic manner. Our probabilistic model makes it possible for us to sample and synthesize many possible future frames from a single input image. To synthesize realistic movement of objects, we propose a novel network structure, namely a Cross Convolutional Network; this network encodes image and motion information as feature maps and convolutional kernels, respectively. In experiments, our model performs well on synthetic data, such as 2D shapes and animated game sprites, and on real-world video frames. We present analyses of the learned network representations, showing it is implicitly learning a compact encoding of object appearance and motion. We also demonstrate a few of its applications, including visual analogy-making and video extrapolation.

机译：我们研究了从单个输入图像合成许多可能的未来帧的问题。与以确定性或非参数方式解决此问题的传统方法相反，我们建议以概率方式对未来框架进行建模。我们的概率模型使我们有可能从单个输入图像中采样和合成许多可能的未来帧。为了合成物体的真实运动，我们提出了一种新颖的网络结构，即交叉卷积网络；该网络将图像和运动信息分别编码为特征图和卷积核。在实验中，我们的模型在合成数据（例如2D形状和动画游戏图片）以及真实视频帧上的表现良好。我们对学习到的网络表示形式进行分析，表明它隐式地学习了对象外观和运动的紧凑编码。我们还将展示其一些应用，包括视觉类比制作和视频外推。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2019年第9期|2236-2250|共15页
作者
Xue Tianfan; Wu Jiajun; Bouman Katherine L.; Freeman William T.;
展开▼
作者单位

Google Inc Mountain View CA 94043 USA;

MIT Dept Elect Engn & Comp Sci Cambridge MA 02139 USA;

Harvard Univ Cambridge MA 02138 USA;

Google Inc Mountain View CA 94043 USA|MIT Dept Elect Engn & Comp Sci Cambridge MA 02139 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
future prediction; frame synthesis; probabilistic modeling; convolutional networks; cross convolution;

机译：未来的预测;帧合成;概率建模卷积网络交叉卷积;

相似文献

外文文献
中文文献
专利

1. Guest Editorial: Cross-Layer Design for Future Generation Wireless Networks [J] . Yuh-Shyan Chen, Athanasios V. Vasilakos, Chien-Chung Shen Wireless Personal Communications . 2009,第3期

机译：客座社论：下一代无线网络的跨层设计
2. Controlling Diffusive Network Dynamics using a Stochastically-Mobile Sensor-Actuator Platform ? [J] . Amirkhosro Vosughi, Mengran Xue, Sandip Roy IFAC PapersOnLine . 2019,第20期

机译：使用随机移动传感器致动器平台控制扩散网络动态？
3. Convolutional neural networks based on multi-scale additive merging layers for visual smoke recognition [J] . Yuan Feiniu, Zhang Lin, Wan Boyang, Machine Vision and Applications . 2019,第2期

机译：基于多尺度累加合并层的卷积神经网络用于视觉烟雾识别
4. Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks [C] . Tianfan Xue, Jiajun Wu, Katherine L. Bouman, Annual conference on Neural Information Processing Systems . 2016

机译：视觉动力学：通过交叉卷积网络的概率未来框架合成
5. Stochastic control for cross-layer resource provisioning in next generation wireless and ad hoc networks. [D] . Li, Anfei. 2008

机译：下一代无线和自组织网络中用于跨层资源供应的随机控制。
6. Stochastic Selection of Activation Layers for Convolutional Neural Networks [O] . Loris Nanni, Alessandra Lumini, Stefano Ghidoni, 2020

机译：卷积神经网络激活层的随机选择
7. Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks [O] . Tianfan Xue, Jiajun Wu, Katherine L. Bouman, 2019

机译：视觉动态：通过分层交叉卷积网络的随机未来生成
8. Cross-Layer Resource Allocation for Wireless Visual Sensor Networks and Mobile Ad Hoc Networks. [R] . Kondi, L. P. 2014

机译：无线视觉传感器网络和移动ad Hoc网络的跨层资源分配。

Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅