首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation
【24h】

S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation

机译:S3VAE:用于表示解开和数据生成的自我监督顺序VAE

获取原文

摘要

We propose a sequential variational autoencoder to learn disentangled representations of sequential data (e.g., videos and audios) under self-supervision. Specifically, we exploit the benefits of some readily accessible supervision signals from input data itself or some off-the-shelf functional models and accordingly design auxiliary tasks for our model to utilize these signals. With the supervision of the signals, our model can easily disentangle the representation of an input sequence into static factors and dynamic factors (i.e., time-invariant and time-varying parts). Comprehensive experiments across videos and audios verify the effectiveness of our model on representation disentanglement and generation of sequential data, and demonstrate that, our model with self-supervision performs comparable to, if not better than, the fully-supervised model with ground truth labels, and outperforms state-of-the-art unsupervised models by a large margin.
机译:我们提出了一种顺序变分自动编码器,以学习在自我监督下顺序数据(例如视频和音频)的解缠表示。具体而言,我们从输入数据本身或某些现成的功能模型中利用了一些易于访问的监管信号的优势,并因此为模型设计了辅助任务以利用这些信号。在信号的监督下,我们的模型可以轻松地将输入序列的表示分解为静态因子和动态因子(即时不变和时变部分)。跨视频和音频进行的全面实验验证了我们的模型在表征解开和顺序数据生成方面的有效性,并表明,具有自我监督功能的我们的模型的性能与具有地面真实性标签的完全监督的模型相当,甚至更好。并大大超越了最新的无监督模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号