首页> 外文会议>IEEE International Conference on Image Processing >Interpretable Representation Learning on Natural Image Datasets via Reconstruction in Visual-Semantic Embedding Space
【24h】

Interpretable Representation Learning on Natural Image Datasets via Reconstruction in Visual-Semantic Embedding Space

机译:通过视觉语义嵌入空间重建的自然图像数据集的可解释表示学习

获取原文

摘要

Unsupervised learning of disentangled representations is a core task for discovering interpretable factors of variation in an image dataset. We propose a novel method that can learn disentangled representations with semantic explanations on natural image datasets. In our method, we guide the representation learning of a variational autoencoder (VAE) via reconstruction in a visual-semantic embedding (VSE) space to leverage the semantic information of image data and explain the learned latent representations in an unsupervised manner. We introduce a semantic sub-encoder and a linear semantic sub-decoder to learn word vectors corresponding to the latent variables to explain factors of variation in the language form. Each basis vector (column) of the linear semantic sub-decoder corresponds to each latent variable, and we can interpret the basis vectors as word vectors indicating the meanings of the latent representations. By introducing the sub-encoder and the sub-decoder, our model can learn latent representations that are not just disentangled but interpretable. Comparing with other state-of-the-art unsupervised disentangled representation learning methods, we observe significant improvements in the disentanglement and the transferability of latent representations.
机译:无拘无措的解散表示的学习是用于发现图像数据集的可解释因素的核心任务。我们提出了一种新颖的方法,可以在自然图像数据集上学习具有语义解释的解除义的表示。在我们的方法中,我们通过在视觉语义嵌入(VSE)空间中的重建来指导变形AutiCoder(VAE)的表示学习,以利用图像数据的语义信息并以无监督的方式解释学习的潜在表示。我们介绍了一个语义子编码器和线性语义子解码器,以学习与潜在变量对应的字向量,以解释语言形式的变化因素。线性语义子解码器的每个基点(列)对应于每个潜在变量,并且我们可以将基向量解释为指示潜在表示的含义的字矢量。通过引入子编码器和子解码器,我们的模型可以学习不仅要解除不诚格但可解释的潜在表示。与其他最先进的无监督的解散代表学习方法相比,我们观察了解开的重大改善和潜在代表的可转移性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号