ULSAM: Ultra-Lightweight Subspace Attention Module for Compact Convolutional Neural Networks

机译：ULSAM：用于紧凑型卷积神经网络的超轻量子空间注意模块

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The capability of the self-attention mechanism to model the long-range dependencies has catapulted its deployment in vision models. Unlike convolution operators, self-attention offers infinite receptive field and enables compute- efficient modeling of global dependencies. However, the existing state-of-the-art attention mechanisms incur high compute and/or parameter overheads, and hence unfit for compact convolutional neural networks (CNNs). In this work, we propose a simple yet effective "Ultra-Lightweight Subspace Attention Mechanism" (ULSAM), which infers different attention maps for each feature map subspace. We argue that leaning separate attention maps for each feature subspace enables multi-scale and multi-frequency feature representation, which is more desirable for fine-grained image classification. Our method of subspace attention is orthogonal and complementary to the existing state-of-the- arts attention mechanisms used in vision models. ULSAM is end-to-end trainable and can be deployed as a plug-and- play module in the pre-existing compact CNNs. Notably, our work is the first attempt that uses a subspace attention mechanism to increase the efficiency of compact CNNs. To show the efficacy of ULSAM, we perform experiments with MobileNet-V1 and MobileNet-V2 as backbone architectures on ImageNet-1K and three fine-grained image classification datasets. We achieve ≈13% and ≈25% reduction in both the FLOPs and parameter counts of MobileNet-V2 with a 0.27% and more than 1% improvement in top-1 accuracy on the ImageNet-1K and fine-grained image classification datasets (respectively). Code and trained models are available at https://github.com/Nandan91/ULSAM.

机译：自我注意机制对远程依赖关系进行建模的能力已使其在视觉模型中的部署迅速发展。与卷积运算符不同，自我注意力提供了无限的接受域，并实现了对全局依赖性的高效计算建模。但是，现有的最新注意力机制会导致较高的计算和/或参数开销，因此不适合紧凑型卷积神经网络（CNN）。在这项工作中，我们提出了一个简单而有效的“超轻量子空间注意机制”（ULSAM），它为每个特征图子空间推断出不同的注意图。我们认为，针对每个特征子空间倾斜单独的注意力图可以实现多尺度和多频率的特征表示，这对于细粒度的图像分类而言更为理想。我们的子空间注意方法是正交的，并且与视觉模型中使用的现有技术水平的注意机制互补。 ULSAM可进行端到端培训，并且可以作为现成的紧凑型CNN中的即插即用模块进行部署。值得注意的是，我们的工作是首次尝试使用子空间注意机制来提高紧凑型CNN的效率。为了展示ULSAM的功效，我们使用MobileNet-V1和MobileNet-V2作为ImageNet-1K和三个细粒度图像分类数据集的主干架构进行了实验。在ImageNet-1K和细粒度图像分类数据集上，我们分别使MobileNet-V2的FLOP和参数计数减少了约13％和约25％，而top-1准确性分别提高了0.27％和1％以上。）。可在https://github.com/Nandan91/ULSAM上找到代码和经过训练的模型。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision》|2020年|1616-1625|共10页
会议地点
作者
Rajat Saini; Nandan Kumar Jha; Bedanta Das; Sparsh Mittal; C. Krishna Mohan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Convolution; Computational modeling; Task analysis; Computational efficiency; Feature extraction; Redundancy; Head;

机译：卷积;计算建模;任务分析;计算效率;特征提取;冗余;头部;

相似文献

外文文献
中文文献
专利

1. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
2. UL-CNN: An Ultra-Lightweight Convolutional Neural Network Aiming at Flash-Based Computing-In-Memory Architecture for Pedestrian Recognition [J] . Yang Chen, Zhang Jingyu, Chen Qi, Journal of circuits, systems and computers . 2021,第2期

机译：UL-CNN：一种超轻型卷积神经网络，旨在用于行人识别的基于闪存的计算内存器架构
3. Attention-aware fully convolutional neural network with convolutional long short-term memory network for ultrasound-based motion tracking [J] . Huang Pu, Yu Gang, Lu Hua, Medical Physics . 2019,第5期

机译：注意基于超声的运动跟踪的卷积长短短期内存网络的完全卷积神经网络
4. Convolutional Neural Network with Attention Modules for Pneumonia Detection [C] . Ghadir Ali, Ahmed Shahin, Mohamed Elhadidi, TPC of International Conference on Innovation and Intelligence for Informatics, Computing and Technologies . 2020

机译：卷积神经网络与肺炎检测的注意力模块
5. Módulos de Atención Temporal para Redes Neuronales con Memoria Externa =Temporary Attention Modules for Neural Networks with External Memory [D] . Palma Otero, Rodolfo. 2020

机译：具有外部存储器的神经网络的临时护理模块=具有外部存储器的神经网络的临时注意力模块
6. ADVIAN: Alzheimers Disease VGG-Inspired Attention Network Based on Convolutional Block Attention Module and Multiple Way Data Augmentation [O] . Shui-Hua Wang, Qinghua Zhou, Ming Yang, 2021

机译：Advian：Alzheimer疾病的疾病是基于卷积块注意模块和多种方式数据增强的主意网络
7. ULSAM: Ultra-Lightweight Subspace Attention Module for Compact Convolutional Neural Networks [O] . Rajat Saini, Nandan Kumar Jha, Bedanta Das, 2020

机译：ULSAM：超轻量级子空间注意力模块，用于紧凑型卷积神经网络

ULSAM: Ultra-Lightweight Subspace Attention Module for Compact Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅