首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Towards Audio to Scene Image Synthesis Using Generative Adversarial Network

【24h】

Towards Audio to Scene Image Synthesis Using Generative Adversarial Network

机译：利用生成对抗网络向音频到场景图像合成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Humans can imagine a scene from a sound. We want machines to do so by using conditional generative adversarial networks (GANs). By applying the techniques including spectral norm, projection discriminator and auxiliary classifier, compared with naive conditional GAN, the model can generate images with better quality in terms of both subjective and objective evaluations. Almost three-fourth of people agree that our model have the ability to generate images related to sounds. By inputting different volumes of the same sound, our model output different scales of changes based on the volumes, showing that our model truly knows the relationship between sounds and images to some extent.

机译：人类可以从声音中想象一个场景。我们希望通过使用条件生成的对冲网络（GAN）来这样做。通过应用包括光谱规范，投影鉴别器和辅助分类器的技术，与天真条件GaN相比，该模型可以在主观和客观评估方面产生具有更好质量的图像。几乎四分之三的人同意我们的模型能够生成与声音相关的图像。通过输入相同声音的不同卷，我们的模型基于卷输出不同的更改尺度，显示我们的模型真正了解声音和图像之间的关系。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing 》|2019年|665p|共5页
会议地点
作者
Chia-Hung Wan; Shun-Po Chuang; Hung-Yi Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
conditional GANs; audio-visual; cross-modal generation;

机译：条件GANS;视听;跨模态生成;

相似文献

外文文献
中文文献
专利

1. A Scene Images Diversity Improvement Generative Adversarial Network for Remote Sensing Image Scene Classification [J] . Pan Xin, Zhao Jian, Xu Jun IEEE Geoscience and Remote Sensing Letters . 2020 ,第10期

机译：用于遥感图像场景分类的场景图像分集改善生成的对抗网络
2. Background and foreground disentangled generative adversarial network for scene image synthesis [J] . Ni Jiancheng, Zhang Susu, Zhou Zili, Computers & Graphics . 2021 ,第Juna期

机译：背景和前景分解生成对抗场景图像合成
3. Multiview Scene Image Inpainting Based on Conditional Generative Adversarial Networks [J] . Zefeng Yuan, Hengyu Li, Jingyi Liu, IEEE Transactions on Intelligent Vehicles . 2020 ,第2期

机译：基于条件生成对抗网络的多视图场景图像修复
4. Towards Audio to Scene Image Synthesis Using Generative Adversarial Network [C] . Chia-Hung Wan, Shun-Po Chuang, Hung-Yi Lee IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：使用生成对抗网络实现音频到场景图像合成
5. Generative Adversarial Networks for Image Synthesis [D] . Zhang, Han. 2019

机译：图像合成的对抗网络
6. Image synthesis of monoenergetic CT image in dual‐energy CT using kilovoltage CT with deep convolutional generative adversarial networks [O] . Daisuke Kawahara, Shuichi Ozawa, Tomoki Kimura, 2021

机译：用深卷积生成对抗网络使用千伏CT的双能CT中单能仪CT图像的图像合成
7. Image synthesis of monoenergetic CT image in dual‐energy CT using kilovoltage CT with deep convolutional generative adversarial networks [O] . Daisuke Kawahara, Shuichi Ozawa, Tomoki Kimura, 2021

机译：用深卷积生成对抗网络使用千伏CT的双能CT中单能仪CT图像的图像合成

Towards Audio to Scene Image Synthesis Using Generative Adversarial Network

摘要

著录项

相似文献

相关主题

期刊订阅