...
首页> 外文期刊>ACM Transactions on Graphics >Light Field Video Capture Using a Learning-Based Hybrid Imaging System
【24h】

Light Field Video Capture Using a Learning-Based Hybrid Imaging System

机译:使用基于学习的混合成像系统进行光场视频捕获

获取原文
获取原文并翻译 | 示例
           

摘要

Light field cameras have many advantages over traditional cameras, as they allow the user to change various camera settings after capture. However, capturing light fields requires a huge bandwidth to record the data: a modern light field camera can only take three images per second. This prevents current consumer light field cameras from capturing light field videos. Temporal interpolation at such extreme scale (10x, from 3 fps to 30 fps) is infeasible as too much information will be entirely missing between adjacent frames. Instead, we develop a hybrid imaging system, adding another standard video camera to capture the temporal information. Given a 3 fps light field sequence and a standard 30 fps 2D video, our system can then generate a full light field video at 30 fps.We adopt a learning-based approach, which can be decomposed into two steps: spatio-temporal flow estimation and appearance estimation. The flow estimation propagates the angular information from the light field sequence to the 2D video, so we can warp input images to the target view. The appearance estimation then combines these warped images to output the final pixels. The whole process is trained end-to-end using convolutional neural networks. Experimental results demonstrate that our algorithm outperforms current video interpolation methods, enabling consumer light field videography, and making applications such as refocusing and parallax view generation achievable on videos for the first time.
机译:与传统相机相比,光场相机具有许多优势,因为它们允许用户在捕获后更改各种相机设置。但是,捕获光场需要巨大的带宽才能记录数据:现代的光场相机每秒只能拍摄三幅图像。这防止了当前的消费者光场照相机捕获光场视频。如此极端的时间插值(从3 fps到30 fps为10倍)的时间插值是不可行的,因为在相邻帧之间将完全丢失太多信息。相反,我们开发了一种混合成像系统,添加了另一个标准摄像机来捕获时间信息。给定一个3 fps的光场序列和一个标准的30 fps的2D视频,我们的系统可以生成30 fps的全光场视频。我们采用基于学习的方法,可以分解为两个步骤:时空流量估算和外观估算。流量估计会将角度信息从光场序列传播到2D视频,因此我们可以将输入图像扭曲到目标视图。然后,外观估计将这些变形的图像进行组合以输出最终像素。使用卷积神经网络对整个过程进行端到端训练。实验结果表明,我们的算法优于目前的视频插值方法,可实现用户光场摄像,并使视频首次实现诸如重新聚焦和视差视图生成等应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号