S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation

机译：S3VAE：用于表示解开和数据生成的自我监督顺序VAE

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose a sequential variational autoencoder to learn disentangled representations of sequential data (e.g., videos and audios) under self-supervision. Specifically, we exploit the benefits of some readily accessible supervision signals from input data itself or some off-the-shelf functional models and accordingly design auxiliary tasks for our model to utilize these signals. With the supervision of the signals, our model can easily disentangle the representation of an input sequence into static factors and dynamic factors (i.e., time-invariant and time-varying parts). Comprehensive experiments across videos and audios verify the effectiveness of our model on representation disentanglement and generation of sequential data, and demonstrate that, our model with self-supervision performs comparable to, if not better than, the fully-supervised model with ground truth labels, and outperforms state-of-the-art unsupervised models by a large margin.

机译：我们提出了一种顺序变分自动编码器，以学习在自我监督下顺序数据（例如视频和音频）的解缠表示。具体而言，我们从输入数据本身或某些现成的功能模型中利用了一些易于访问的监管信号的优势，并因此为模型设计了辅助任务以利用这些信号。在信号的监督下，我们的模型可以轻松地将输入序列的表示分解为静态因子和动态因子（即时不变和时变部分）。跨视频和音频进行的全面实验验证了我们的模型在表征解开和顺序数据生成方面的有效性，并表明，具有自我监督功能的我们的模型的性能与具有地面真实性标签的完全监督的模型相当，甚至更好。并大大超越了最新的无监督模型。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2020年|6537-6546|共10页
会议地点
作者
Yizhe Zhu; Martin Renqiang Min; Asim Kadav; Hans Peter Graf;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Videos; Data models; Task analysis; Visualization; Dynamics; Computational modeling; Three-dimensional displays;

机译：视频;数据模型;任务分析;可视化;动力学;计算建模;三维显示;

相似文献

外文文献
中文文献
专利

1. Y-Autoencoders: Disentangling latent representations via sequential encoding [J] . Patacchiola Massimiliano, Fox-Roberts Patrick, Rosten Edward Pattern recognition letters . 2020,第Deca期

机译：Y-AutoEncoders：通过顺序编码解开潜在的表示
2. Self-supervised learning for tool wear monitoring with a disentangled-variational-autoencoder [J] . Tim von Hahn, Chris K. Mechefske International Journal of Hydromechatronics . 2021,第1期

机译：具有解除态变差 - 自动化器的工具磨损监控自我监督学习
3. DCR: Disentangled component representation for sketch generation [J] . Cao Zhong, Cui Sen, Zhang Changshui Pattern recognition letters . 2021,第May期

机译：DCR：剪影生成的解除组件表示
4. Polarized-VAE: Proximity Based Disentangled Representation Learning for Text Generation [C] . Vikash Balasubramanian, Ivan Kobyzev, Hareesh Bahuleyan, Conference of the European Chapter of the Association for Computational Linguistics . 2021

机译：偏振 - VAE：基于邻近的分解代表学习文本生成
5. A Deeper Look at the Unsupervised Learning of Disentangled Representations in β-Vae from the Perspective of Core Object Recognition [D] . Sikka, Harshvardhan Digvijay. 2020

机译：从核心对象识别的角度来看，更深入地看看β-VAE中的解散表示的无情的学习
6. MichiGAN: sampling from disentangled representations of single-cell data using generative adversarial networks [O] . Hengshi Yu, Joshua D. Welch 2021

机译：密歇根州：使用生成的对抗网络从单细胞数据的脱屑表示抽样
7. S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation [O] . Yizhe Zhu, Martin Renqiang Min, Asim Kadav, 2020

机译：S3VAE：用于表示解剖和数据生成的自我监督顺序VAE

S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅