首页> 外文会议>International Conference on Computer Communication and the Internet >ViT-GAN: Using Vision Transformer as Discriminator with Adaptive Data Augmentation

【24h】

ViT-GAN: Using Vision Transformer as Discriminator with Adaptive Data Augmentation

机译：Vit-GaN：使用视觉变压器作为具有自适应数据增强的鉴别器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

These days, attention is thought to be an efficient way to recognize an image. Vision Transformer (ViT) uses a Transformer for images and has very high performance in image recognition. ViT has fewer parameters than Big Transfer (BiT) and Noisy Student. Therefore, we consider that Self-Attention-based networks are slimmer than convolution-based networks. We use a ViT as a Discriminator in a Generative Adversarial Network (GAN) to get the same performance with a smaller model. We name it ViT-GAN. Besides, we find parameter sharing is very useful to make parameter-efficient ViT. However, the performances of ViT heavily depend on the number of data samples. Therefore, we propose a new method of Data Augmentation. Our Data Augmentation, in which the strength of Data Augmentation varies adaptively, helps ViT for faster convergence and better performance. With our Data Augmentation, we show ViT-based discriminator can achieve almost the same FID but the number of the parameters of the discriminator is 35% fewer than the original discriminator.

机译：这些天，被认为是识别图像的有效方法。视觉变压器（VIT）使用变压器进行图像，在图像识别中具有很高的性能。 Vit的参数比大转移（位）和嘈杂的学生更少。因此，我们认为基于自我关注的网络比基于卷积的网络更纤薄。我们在生成的对抗性网络（GAN）中使用VIT作为鉴别器，以获得更小的模型的性能。我们称之为vit-gan。此外，我们发现参数共享非常有用，可以进行参数效率。然而，VIT的性能大量取决于数据样本的数量。因此，我们提出了一种新的数据增强方法。我们的数据增强，其中数据增强的强度适自变化，有助于更快的收敛性和更好的性能。通过我们的数据增强，我们展示了基于Vit的鉴别器可以实现几乎相同的FID，但鉴别器的参数的数量比原始鉴别器少35％。

著录项

来源
《International Conference on Computer Communication and the Internet 》|2021年|185-189|共5页
会议地点
作者
Shota Hirose; Naoki Wada; Jiro Katto; Heming Sun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Adaptation models; Image recognition; Computational modeling; Complex networks; Generative adversarial networks; Internet;

机译：培训;适应模型;图像识别;计算建模;复杂网络;生成的对抗网络;互联网;

相似文献

外文文献
中文文献
专利

1. Gaussian Discriminative Analysis aided GAN for imbalanced big data augmentation and fault classification [J] . Zhuo Yue, Ge Zhiqiang Journal of Process Control . 2020 ,第1期

机译：高斯鉴别分析辅助GaN用于非衡度大数据增强和故障分类
2. Data augmentation strategy for small sample short-term load forecasting of distribution transformer [J] . Zhang Yufan, Ai Qian, Li Zhaoyu, European transactions on electrical power engineering . 2020 ,第7期

机译：分布式变压器小示例短期负荷预测的数据增强策略
3. Adaptive compressed sensing of Raman spectroscopic profiling data for discriminative tasks [J] . Talanta: The International Journal of Pure and Applied Analytical Chemistry . 2020 ,第期

机译：拉曼光谱分析数据的自适应压缩感应辨别任务
4. Time Series Data Augmentation for Neural Networks by Time Warping with a Discriminative Teacher [C] . Brian Kenji Iwana, Seiichi Uchida International Conference on Pattern Recognition . 2021

机译：用歧视教师时序序列对神经网络的数据增强
5. Training Discriminative Computer Vision Models with Weak Supervision. [D] . Babenko, Boris. 2012

机译：用弱监督训练具有区别性的计算机视觉模型。
6. MeshCut data augmentation for deep learning in computer vision [O] . Wei Jiang, Kai Zhang, Nan Wang, 2020

机译：计算机愿景深度学习的网格数据增强
7. Improving Sentiment Analysis over non-English Tweets using Multilingual Transformers and Automatic Translation for Data-Augmentation [O] . Valentin Barriere, Alexandra Balahur 2020

机译：利用多语言变压器提高非英语推文的情感分析，并自动翻译数据增强

ViT-GAN: Using Vision Transformer as Discriminator with Adaptive Data Augmentation

摘要

著录项

相似文献

相关主题

期刊订阅