...
首页> 外文期刊>Journal of visual communication & image representation >Matrix-variate variational auto-encoder with applications to image process
【24h】

Matrix-variate variational auto-encoder with applications to image process

机译:具有应用于图像过程的矩阵变变自动编码器

获取原文
获取原文并翻译 | 示例

摘要

Variational Auto-Encoder (VAE) is an important probabilistic technology to model 1D vectorial data. However, when applying VAE model to 2D image, vectorization is necessary. Vectorization process may lead to dimension curse and lose valuable spatial information. To avoid these problems, we propose a novel VAE model based on matrix variables named as Matrix-variate Variational Auto-Encoder (MVVAE). In this model, input, hidden and latent variables are all in matrix form, therefore inherent spatial structure of 2D images can be maintained and utilized better. Especially, the latent variable is assumed to follow matrix Gaussian distribution which is more suitable for describing 2D images. To solve the weights and the posterior of latent variable, the variational inference process is given. The experiments are designed for three real-world application: reconstruction, denoising and completion. The experimental results demonstrate that MVVAE shows better performance than VAE and other probabilistic methods for modeling and processing 2D data. (C) 2020 Elsevier Inc. All rights reserved.
机译:变形式自动编码器(VAE)是模拟1D矢量数据的重要概率技术。但是,在将VAE模型应用于2D图像时,矢量化是必要的。矢量化过程可能导致尺寸诅咒并失去有价值的空间信息。为避免这些问题,我们提出了一种基于矩阵变量的新型VAE模型,名为矩阵变变自动编码器(MVVAE)。在该模型中,输入,隐藏和潜变量全部以矩阵形式,因此可以更好地维护和使用2D图像的固有空间结构。特别地,假设潜变量遵循更适合于描述2D图像的矩阵高斯分布。为了解决潜在变量的权重和后部,给出了变分推理过程。实验专为三个现实世界应用:重建,去噪和完成。实验结果表明,MVVAE表现出比VAE和其他用于建模和处理2D数据的概率方法更好的性能。 (c)2020 Elsevier Inc.保留所有权利。

著录项

  • 来源
    《Journal of visual communication & image representation 》 |2020年第2期| 102750.1-102750.9| 共9页
  • 作者单位

    Beijing Univ Technol Fac Informat Technol Beijing Key Lab Multimedia & Intelligent Software Beijing Peoples R China;

    Beijing Univ Technol Fac Informat Technol Beijing Key Lab Multimedia & Intelligent Software Beijing Peoples R China;

    Univ Sydney Univ Sydney Business School Discipline Business Analyt Sydney NSW 2006 Australia;

    Beijing Univ Technol Fac Informat Technol Beijing Key Lab Multimedia & Intelligent Software Beijing Peoples R China;

    Beijing Univ Technol Fac Informat Technol Beijing Key Lab Multimedia & Intelligent Software Beijing Peoples R China;

    Beijing Univ Technol Fac Informat Technol Beijing Key Lab Multimedia & Intelligent Software Beijing Peoples R China;

    Beijing Univ Technol Fac Informat Technol Beijing Key Lab Multimedia & Intelligent Software Beijing Peoples R China|Dalian Univ Technol Fac Elect Informat & Elect Engn Dalian Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Variational autoencoder; Matrix Gaussian distribution; Variational inference; Face completion; Image denoising;

    机译:变形式自动化器;矩阵高斯分布;变分推理;面部完成;图像去噪;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号