...
首页> 外文期刊>JMLR: Workshop and Conference Proceedings >The Gaussian Process Prior VAE for Interpretable Latent Dynamics from Pixels
【24h】

The Gaussian Process Prior VAE for Interpretable Latent Dynamics from Pixels

机译:高斯过程先验VAE用于可解释像素的潜在动力学

获取原文

摘要

We consider the problem of unsupervised learning of a low dimensional, interpretable, latent state of a video containing a moving object. The problem of distilling interpretable dynamics from pixels has been extensively considered through the lens of graphical/state space models (Fraccaro et al., 2017; Lin et al., 2018; Pearce et al., 2018; Chiappa and Paquet, 2019) that exploit Markov structure for cheap computation and structured priors for enforcing interpretability on latent representations. We take a step towards extending these approaches by discarding the Markov structure; inspired by Gaussian process dynamical models (Wang et al., 2006), we instead repurpose the recently proposed Gaussian Process Prior Variational Autoencoder (Casale et al., 2018) for learning interpretable latent dynamics. We describe the model and perform experiments on a synthetic dataset and see that the model reliably reconstructs smooth dynamics exhibiting U-turns and loops. We also observe that this model may be trained without any annealing or freeze-thaw of training parameters in contrast to previous works, albeit for slightly dierent use cases, where application specic training tricks are often required.
机译:我们考虑了无监督学习包含运动对象的视频的低维,可解释,潜在状态的问题。从图形/状态空间模型的角度已经广泛考虑了从像素中提取可解释动态的问题(Fraccaro等人,2017; Lin等人,2018; Pearce等人,2018; Chiappa和Paquet,2019)利用马尔可夫结构进行廉价计算,并利用结构化先验来增强潜在表示的可解释性。我们通过放弃马尔可夫结构来扩展这些方法。受高斯过程动力学模型(Wang等人,2006)的启发,我们改用了最近提出的高斯过程先验变分自动编码器(Casale等人,2018)来学习可解释的潜在动力学。我们描述了该模型并在一个综合数据集上进行了实验,发现该模型可靠地重建了呈现U型转弯和回旋的平滑动力学。我们还观察到,与以前的工作相比,该模型可以在没有任何退火或冻融训练参数的情况下进行训练,尽管对于使用情况稍有不同的用例,在这种情况下通常需要应用特定的训练技巧。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号