Towards Audio to Scene Image Synthesis Using Generative Adversarial Network

机译：使用生成对抗网络实现音频到场景图像合成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Humans can imagine a scene from a sound. We want machines to do so by using conditional generative adversarial networks (GANs). By applying the techniques including spectral norm, projection discriminator and auxiliary classifier, compared with naive conditional GAN, the model can generate images with better quality in terms of both subjective and objective evaluations. Almost three-fourth of people agree that our model have the ability to generate images related to sounds. By inputting different volumes of the same sound, our model output different scales of changes based on the volumes, showing that our model truly knows the relationship between sounds and images to some extent.

机译：人类可以从声音中想象出一个场景。我们希望机器通过使用条件生成对抗网络（GAN）来做到这一点。通过应用包括频谱范数，投影鉴别器和辅助分类器在内的技术，与朴素的条件GAN相比，该模型可以在主观和客观评估方面生成质量更高的图像。几乎四分之三的人同意我们的模型具有生成与声音相关的图像的能力。通过输入相同声音的不同音量，我们的模型基于音量输出不同的变化比例，这表明我们的模型在一定程度上真正了解了声音和图像之间的关系。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2019年|496-500|共5页
会议地点
作者
Chia-Hung Wan; Shun-Po Chuang; Hung-Yi Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
audio signal processing; image processing; neural nets; spectral analysis;

机译：音频信号处理;图像处理;神经网络;光谱分析;

相似文献

外文文献
中文文献
专利

1. A Scene Images Diversity Improvement Generative Adversarial Network for Remote Sensing Image Scene Classification [J] . Pan Xin, Zhao Jian, Xu Jun IEEE Geoscience and Remote Sensing Letters . 2020,第10期

机译：用于遥感图像场景分类的场景图像分集改善生成的对抗网络
2. Background and foreground disentangled generative adversarial network for scene image synthesis [J] . Ni Jiancheng, Zhang Susu, Zhou Zili, Computers & Graphics . 2021,第Juna期

机译：背景和前景分解生成对抗场景图像合成
3. Multiview Scene Image Inpainting Based on Conditional Generative Adversarial Networks [J] . Zefeng Yuan, Hengyu Li, Jingyi Liu, IEEE Transactions on Intelligent Vehicles . 2020,第2期

机译：基于条件生成对抗网络的多视图场景图像修复
4. Towards Audio to Scene Image Synthesis Using Generative Adversarial Network [C] . Chia-Hung Wan, Shun-Po Chuang, Hung-Yi Lee IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：利用生成对抗网络向音频到场景图像合成
5. Generative Adversarial Networks for Image Synthesis [D] . Zhang, Han. 2019

机译：图像合成的对抗网络
6. Image synthesis of monoenergetic CT image in dual‐energy CT using kilovoltage CT with deep convolutional generative adversarial networks [O] . Daisuke Kawahara, Shuichi Ozawa, Tomoki Kimura, 2021

机译：用深卷积生成对抗网络使用千伏CT的双能CT中单能仪CT图像的图像合成
7. Image synthesis of monoenergetic CT image in dual‐energy CT using kilovoltage CT with deep convolutional generative adversarial networks [O] . Daisuke Kawahara, Shuichi Ozawa, Tomoki Kimura, 2021

机译：用深卷积生成对抗网络使用千伏CT的双能CT中单能仪CT图像的图像合成

Towards Audio to Scene Image Synthesis Using Generative Adversarial Network

摘要

著录项

相似文献

相关主题

期刊订阅