Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition

机译：具有区域自我注意的膨胀式情节记忆，用于长尾视觉识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There have been increasing interests in modeling long-tailed data. Unlike artificially collected datasets, long-tailed data are naturally existed in the real-world and thus more realistic. To deal with the class imbalance problem, we introduce an Inflated Episodic Memory (IEM) for long-tailed visual recognition. First, our IEM augments the convolutional neural networks with categorical representative features for rapid learning on tail classes. In traditional few-shot learning, a single prototype is usually leveraged to represent a category. However, long-tailed data has higher intra-class variances. It could be challenging to learn a single prototype for one category. Thus, we introduce IEM to store the most discriminative feature for each category individually. Besides, the memory banks are updated independently, which further decreases the chance of learning skewed classifiers. Second, we introduce a novel region self-attention mechanism for multi-scale spatial feature map encoding. It is beneficial to incorporate more discriminative features to improve generalization on tail classes. We propose to encode local feature maps at multiple scales, and the spatial contextual information should be aggregated at the same time. Equipped with IEM and region self-attention, we achieve state-of-the-art performance on four standard long-tailed image recognition benchmarks. Besides, we validate the effectiveness of IEM on a long-tailed video recognition benchmark, i.e., YouTube-8M.

机译：人们对长尾数据建模越来越感兴趣。与人工收集的数据集不同，长尾数据自然存在于现实世界中，因此更为真实。为了解决类不平衡问题，我们引入了膨胀式情节记忆（IEM），用于长尾视觉识别。首先，我们的IEM通过分类代表特征增强了卷积神经网络，以便快速学习尾巴类。在传统的一次性学习中，通常使用单个原型来表示类别。但是，长尾数据具有较高的类内方差。为一个类别学习单个原型可能是具有挑战性的。因此，我们引入IEM来分别存储每个类别的最具区别性的功能。此外，存储库是独立更新的，这进一步减少了学习倾斜分类器的机会。其次，我们介绍了一种用于多尺度空间特征图编码的新颖的区域自我关注机制。合并更多区分性特征以改进尾类的通用性是有益的。我们建议以多个比例对局部特征图进行编码，并且空间上下文信息应同时进行汇总。配备IEM和区域自我关注功能，我们在四个标准的长尾图像识别基准上实现了最先进的性能。此外，我们在长尾视频识别基准（即YouTube-8M）上验证了IEM的有效性。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2020年|4343-4352|共10页
会议地点
作者
Linchao Zhu; Yi Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Prototypes; Training; Feature extraction; Robustness; Data models; Encoding;

机译：可视化;原型;训练;特征提取;稳健性;数据模型;编码;

相似文献

外文文献
中文文献
专利

1. Episodic Short-Term Recognition Requires Encoding into Visual Working Memory: Evidence from Probe Recognition after Letter Report [J] . Christian H. Poth, Werner X. Schneider Frontiers in Psychology . 2016,第4期

机译：短暂的短时识别需要编码到视觉工作记忆中：来信报告后探针识别的证据
2. Episodic Short-Term Recognition Requires Encoding into Visual Working Memory: Evidence from Probe Recognition after Letter Report [J] . Christian H. Poth, Werner X. Schneider Frontiers in Psychology . 2016,第1期

机译：短暂的短时识别需要编码到视觉工作记忆中：来信报告后探针识别的证据
3. Interactions between Visual Attention and Episodic Retrieval: Dissociable Contributions of Parietal Regions during Gist-Based False Recognition [J] . GuerinS.A., RobbinsC.A., GilmoreA.W., Neuron . 2012,第6期

机译：视觉关注与兴高学检索之间的相互作用：基于要的基于GIST的虚假识别期间的顶位区域的可解离贡献
4. Episodic Memory Network with Self-attention for Emotion Detection [C] . Jiangping Huang, Zliong Lin, Xin Liu International Workshop on Big Data Management and Service;International Conference on Database Systems for Advanced Applications;International Workshop on Big Data Quality Management;International Workshop on Graph Data Management and Analysis . 2019

机译：具有自我注意力的情境记忆网络用于情绪检测
5. The Effect of Visual Search and Audio-Visual Entrainment on Episodic Memory [D] . Westfall, Holly A. 2013

机译：视觉搜索和视听带动对情景记忆的影响
6. Episodic Short-Term Recognition Requires Encoding into Visual Working Memory: Evidence from Probe Recognition after Letter Report [O] . Christian H. Poth, Werner X. Schneider -1

机译：突发性短期识别需要编码到视觉工作记忆中：来自信函报告后探针识别的证据
7. Episodic Short-Term Recognition Requires Encoding into Visual Working Memory: Evidence from Probe Recognition after Letter Report [O] . Christian H. Poth, Werner X. Schneider 2016

机译：情景短期识别需要编码到视觉工作记忆中：来自信件报告后探针识别的证据

Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition

摘要

著录项

相似文献

相关主题

期刊订阅