首页> 外文会议>German Conference on Pattern Recognition >Temporal Interpolation as an Unsupervised Pretraining Task for Optical Flow Estimation
【24h】

Temporal Interpolation as an Unsupervised Pretraining Task for Optical Flow Estimation

机译:时间插值作为光流估计的无监督预训练任务

获取原文

摘要

The difficulty of annotating training data is a major obstacle to using CNNs for low-level tasks in video. Synthetic data often does not generalize to real videos, while unsupervised methods require heuristic losses. Proxy tasks can overcome these issues, and start by training a network for a task for which annotation is easier or which can be trained unsupervised. The trained network is then fine-tuned for the original task using small amounts of ground truth data. Here, we investigate frame interpolation as a proxy task for optical flow. Using real movies, we train a CNN unsupervised for temporal interpolation. Such a network implicitly estimates motion, but cannot handle untextured regions. By fine-tuning on small amounts of ground truth flow, the network can learn to fill in homogeneous regions and compute full optical flow fields. Using this unsupervised pre-training, our network outperforms similar architectures that were trained supervised using synthetic optical flow.
机译:注释训练数据的困难是将CNN用于视频中的低级任务的主要障碍。合成数据通常不能推广到真实视频,而无监督方法则需要启发式损失。代理任务可以克服这些问题,并可以从培训网络开始以完成注释更容易或可以不受监督地进行培训的任务。然后,使用少量的地面真实数据对受过训练的网络进行微调以完成原始任务。在这里,我们研究帧插值作为光流的代理任务。使用真实电影,我们训练了无监督的CNN进行时间插值。这样的网络隐式地估计运动,但是不能处理无纹理的区域。通过微调少量地面实况流,网络可以学习填充均匀区域并计算完整的光流场。使用这种无监督的预训练,我们的网络优于使用合成光流进行监督训练的类似体系结构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号