Triple discriminator generative adversarial network for zero-shot image classification

Zhong JI; Jiangtao YAN; Qiang WANG; Yanwei PANG; Xuelong LI

首页> 中文期刊> 《中国科学》 >Triple discriminator generative adversarial network for zero-shot image classification

Triple discriminator generative adversarial network for zero-shot image classification

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

One key challenge in zero-shot classification（ZSC） is the exploration of knowledge hidden in unseen classes. Generative methods such as generative adversarial networks（GANs） are typically employed to generate the visual information of unseen classes. However, the majority of these methods exploit global semantic features while neglecting the discriminative differences of local semantic features when synthesizing images, which may lead to sub-optimal results. In fact, local semantic information can provide more discriminative knowledge than global information can. To this end, this paper presents a new triple discriminator GAN for ZSC called TDGAN, which incorporates a text-reconstruction network into a dual discriminator GAN（D2GAN）, allowing to realize cross-modal mapping from text descriptions to their visual representations. The text-reconstruction network focuses on key text descriptions for aligning semantic relationships to enable synthetic visual features to effectively represent images. Sharma-Mittal entropy is exploited in the loss function to make the distribution of synthetic classes be as close as possible to the distribution of real classes. The results of extensive experiments over the Caltech-UCSD Birds-2011 and North America Birds datasets demonstrate that the proposed TDGAN method consistently yields competitive performance compared to several state-of-the-art ZSC methods.

著录项

来源
《中国科学》 |2021年第2期|5-18|共14页
作者
Zhong JI; Jiangtao YAN; Qiang WANG; Yanwei PANG; Xuelong LI;
展开▼
作者单位

1. School of Electrical and Information Engineering;

Tianjin University 2. Center for OPTical IMagery Analysis and Learning;

Northwestern Polytechnical University;

展开▼
原文格式 PDF
正文语种 chi
中图分类人工智能理论;
关键词

相似文献

中文文献
外文文献
专利

1. Better Visual Image Super-Resolution with Laplacian Pyramid of Generative Adversarial Networks [J] . Ming Zhao ,Xinhong Liu ,Xin Yao . 计算机、材料和连续体(英文) . 2020,第9期
2. Image Super-Resolution Based on Generative Adversarial Networks: A Brief Review [J] . Kui Fu ,Jiansheng Peng ,Hanxiao Zhang . 计算机、材料和连续体(英文) . 2020,第9期
3. Generative Adversarial Network-Based Electromagnetic Signal Classification: A Semi-Supervised Learning Framework [J] . Huaji Zhou ,Licheng Jiao ,Shilian Zheng . 中国通信 . 2020,第010期
4. Structural Reduction For Controlling Complex Networks——Control Kernel and Node Classification [C] . 狄增如 . 2015第十一届中国网络科学论坛 . 2015
5. RGB--to--NIR Image Translation Using Generative Adversarial Network [A] . Yi Huang . 2020

Triple discriminator generative adversarial network for zero-shot image classification

摘要

著录项

相似文献

相关主题

期刊订阅