Guided CNN for generalized zero-shot and open-set recognition using visual and semantic prototypes

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Guided CNN for generalized zero-shot and open-set recognition using visual and semantic prototypes

【24h】

Guided CNN for generalized zero-shot and open-set recognition using visual and semantic prototypes

机译：使用视觉和语义原型引导CNN用于广义零点和开放式识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the process of exploring the world, the curiosity constantly drives humans to cognize new things. Supposing you are a zoologist, for a presented animal image, you can recognize it immediately if you know its class. Otherwise, you would more likely attempt to cognize it by exploiting the side-information (e.g., semantic information, etc.) you have accumulated. Inspired by this, this paper decomposes the generalized zero-shot learning (G-ZSL) task into an open set recognition (OSR) task and a zero-shot learning (ZSL) task, where OSR recognizes seen classes (if we have seen (or known) them) and rejects unseen classes (if we have never seen (or known) them before), while ZSL identifies the unseen classes rejected by the former. Simultaneously, without violating OSR's assumptions (only known class knowledge is available in training), we also first attempt to explore a new generalized open set recognition (G-OSR) by introducing the accumulated side-information from known classes to OSR. For G-ZSL, such a decomposition effectively solves the class overfitting problem with easily misclassifying unseen classes as seen classes. The problem is ubiquitous in most existing G-ZSL methods. On the other hand, for G-OSR, introducing such semantic information of known classes not only improves the recognition performance but also endows OSR with the cognitive ability of unknown classes. Specifically, a visual and semantic prototypes-jointly guided convolutional neural network (VSG-CNN) is proposed to fulfill these two tasks (G-ZSL and G-OSR) in a unified end-to-end learning framework. Extensive experiments on benchmark datasets demonstrate the advantages of our learning framework. (C) 2020 Elsevier Ltd. All rights reserved.

机译：在探索世界的过程中，好奇心不断推动人类以认识到新事物。假设你是一个动物形象，对于呈现的动物形象，如果你知道它的课程，你可以立即识别它。否则，您将更有可能尝试通过利用您累积的侧面信息（例如，语义信息等）来认识它。受此启发，本文将广义零射击学习（G-ZSL）任务分解为开放式识别（OSR）任务和零拍摄学习（ZSL）任务，其中OSR识别出看等级（如果我们看到（或者已知它们）并拒绝未经看不见的类（如果我们之前从未见过（或已知）它们），而ZSL识别由前者拒绝的看不见的课程。同时，在不违反OSR的假设（训练中只有已知的类知识），我们也首先尝试通过将已知类的累积侧信息从已知类引入OSR来探索新的广义开放式识别（G-OSR）。对于G-ZSL，这种分解有效地解决了课堂过度拟合问题，随着所看到的课程容易错误分类。在大多数现有的G-ZSL方法中，问题是普遍存在的。另一方面，对于G-OSR，介绍了已知类别的这样的语义信息不仅可以提高识别性能，而且还赋予OSR，以赋予未知类的认知能力。具体地，提出了一种视觉和语义原型 - 联合引导的卷积神经网络（VSG-CNN）以在统一的端到端学习框架中满足这两个任务（G-ZSL和G-OSR）。基准数据集的广泛实验证明了我们学习框架的优势。（c）2020 elestvier有限公司保留所有权利。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2020年第2020期|共10页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Convolutional prototype learning; Generalized zero-shot Learning; Open set recognition;

机译：卷积原型学习;广义零射击学习;开放式识别;

相似文献

外文文献
中文文献
专利

1. Guided CNN for generalized zero-shot and open-set recognition using visual and semantic prototypes [J] . Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：使用视觉和语义原型引导CNN用于广义零点和开放式识别
2. Zero-Shot Visual Recognition via Semantic Attention-Based Compare Network [J] . Nian Fudong, Sheng Yikun, Wang Junfeng, Quality Control, Transactions . 2020,第期

机译：通过基于语义关注的比较网络零拍摄视觉识别
3. Spatiotemporal visual-semantic embedding network for zero-shot action recognition [J] . An Rongqiao, Miao Zhenjiang, Li Qingyu, Journal of electronic imaging . 2019,第2期

机译：零时空动作识别的时空视觉语义嵌入网络
4. Generalized Zero-Shot Recognition Through Image-Guided Semantic Classification [C] . Fang Li, Mei-Chen Yeh IEEE International Conference on Image Processing . 2021

机译：通过图像引导语义分类广义零射识别
5. Zero-Shot Visual Recognition via Latent Embedding Learning [D] . Wang, Qian. 2018

机译：潜在嵌入学习的零发视觉识别
6. Zero-Shot Human Activity Recognition Using Non-Visual Sensors [O] . Fadi Al Machot, Mohammed R. Elkobaisi, Kyandoghere Kyamakya 2020

机译：使用非视觉传感器的零拍人类活动识别
7. A Prototype-Based Generalized Zero-Shot Learning Framework for Hand Gesture Recognition [O] . Jinting Wu, Yujia Zhang, Xiaoguang Zhao 2021

机译：用于手势识别的基于原型的广义零射击学习框架

Guided CNN for generalized zero-shot and open-set recognition using visual and semantic prototypes

摘要

著录项

相似文献

相关主题

期刊订阅