...
首页> 外文期刊>Acoustical science and technology >Generative adversarial networks: Foundations and applications
【24h】

Generative adversarial networks: Foundations and applications

机译:生成对抗网络:基础和应用

获取原文

摘要

In statistical signal processing and machine learning, an open issue has been how to obtain a generative model that can produce samples from high-dimensional data distributions such as images and speeches. Generative adversarial networks (GANs) have emerged as a powerful framework that provides clues to solving this problem. A GAN is composed of two networks: a generator that transforms noise variables to data space and a discriminator that discriminates real and generated data. These two networks are optimized using a min-max game: the generator attempts to deceive the discriminator by generating data indistinguishable from the real data, while the discriminator attempts not to be deceived by the generator by finding the best discrimination between real and generated data. This novel framework enables the implicit estimation of a data distribution and enables the generator to generate high-fidelity data that are almost indistinguishable from real data. This beneficial and powerful property has attracted a great deal of attention, and a wide range of research, from basic research to practical applications, has been recently conducted. In this paper, I summarize these studies and explain the foundations and applications of GANs. Specifically, I first clarify the relation between GANs and other deep generative models then provide the theory of GANs with numerical formula. Next, I introduce recent advances in GANs and describe the impressive applications that are highly related to acoustic and speech signal processing. Finally, I conclude this paper by mentioning future directions.
机译:在统计信号处理和机器学习中,一个未解决的问题是如何获得一个生成模型,该模型可以从高维数据分布(例如图像和语音)中生成样本。生成对抗网络(GAN)已经成为功能强大的框架,为解决此问题提供了线索。 GAN由两个网络组成:将噪声变量转换为数据空间的生成器和用于区分实际数据和生成数据的鉴别器。这两个网络使用最小-最大博弈进行了优化:生成器试图通过生成与真实数据无法区分的数据来欺骗鉴别器,而鉴别器试图通过找到真实数据与生成数据之间的最佳区分来不被生成器欺骗。这种新颖的框架能够隐式估计数据分布,并使生成器能够生成与真实数据几乎无法区分的高保真度数据。这种有益而强大的特性引起了广泛的关注,并且最近进行了从基础研究到实际应用的广泛研究。在本文中,我总结了这些研究并解释了GAN的基础和应用。具体而言,我首先阐明GAN与其他深度生成模型之间的关系,然后为GAN提供具有数值公式的理论。接下来,我将介绍GAN的最新进展,并描述与声学和语音信号处理高度相关的令人印象深刻的应用。最后,我通过提及未来的方向来总结本文。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号