Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection

Song Kaiyou; Yang Hua; Yin Zhouping

首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection

【24h】

Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection

机译：多尺度注意力深度神经网络用于快速准确的目标检测

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Object detection remains a challenging task in computer vision due to the tremendous extent of changes in the appearances of objects caused by clustered backgrounds, occlusion, truncation, and scale change. Current deep neural network (DNN)-based object detection methods cannot simultaneously achieve a high accuracy and a high efficiency. To overcome this limitation, in this paper, we propose a novel multi-scale attention (MSA) DNN for accurate object detection with high efficiency. The proposed MSA-DNN method utilizes a novel multi-scale feature fusion module (MSFFM) to construct high-level semantic features. Subsequently, a novel MSA module (MSAM) based on the fused layers of the MSFFM is introduced to exploit the global semantic information of image-level labels to guide detection. On the one hand, MSAM can capture global semantic information to further enhance the semantic feature representation of the fused layers constructed by the MSFFM, thereby improving the detection accuracy. On the other hand, the MSA maps generated by MSAM can be employed to rapidly and coarsely locate objects at different scales. In addition, an attention-based hard negative mining strategy is introduced to filter out negative samples to reduce the search space, dramatically alleviating the severe class imbalance problem. Extensive experimental results on the challenging PASCAL VOC 2007, PASCAL VOC 2012, and MS COCO datasets demonstrate that MSA-DNN achieves a state-of-the-art detection accuracy while maintaining a high efficiency. Furthermore, MSA-DNN significantly improves the small-object detection accuracy.

机译：由于群集背景，遮挡，截断和缩放变化导致的对象外观变化很大，因此对象检测在计算机视觉中仍然是一项具有挑战性的任务。当前基于深度神经网络（DNN）的对象检测方法无法同时实现高精度和高效率。为了克服这一限制，在本文中，我们提出了一种新颖的多尺度注意力（MSA）DNN，可以高效地进行精确的目标检测。提出的MSA-DNN方法利用新颖的多尺度特征融合模块（MSFFM）来构建高级语义特征。随后，引入了一种基于MSFFM融合层的新颖MSA模块（MSAM），以利用图像级标签的全局语义信息来指导检测。一方面，MSAM可以捕获全局语义信息，以进一步增强MSFFM构造的融合层的语义特征表示，从而提高检测精度。另一方面，由MSAM生成的MSA映射可用于快速粗略地定位不同比例的对象。另外，引入了一种基于注意的硬否定挖掘策略，以过滤出否定样本以减少搜索空间，从而大大缓解了严重的类不平衡问题。在具有挑战性的PASCAL VOC 2007，PASCAL VOC 2012和MS COCO数据集上进行的大量实验结果表明，MSA-DNN在保持高效率的同时达到了最先进的检测精度。此外，MSA-DNN大大提高了小物体检测的准确性。

著录项

来源
《IEEE Transactions on Circuits and Systems for Video Technology 》 |2019年第10期| 2972-2985| 共14页
作者
Song Kaiyou; Yang Hua; Yin Zhouping;
展开▼
作者单位

Huazhong Univ Sci & Technol State Key Lab Digital Mfg Equipment & Technol Sch Mech Sci & Engn Wuhan 430074 Hubei Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Object detection; attention model; feature fusion; deep neural network;

机译：对象检测;注意模型特征融合深层神经网络;

相似文献

外文文献
中文文献
专利

1. Automatic and Accurate Epilepsy Ripple and Fast Ripple Detection via Virtual Sample Generation and Attention Neural Networks [J] . Guo Jiayang, Li Hailong, Pan Yijie, IEEE transactions on neural systems and rehabilitation engineering . 2020 ,第8期

机译：通过虚拟样本生成和注意神经网络自动和准确的癫痫纹波和快速纹波检测
2. Multi-scale deep neural network for salient object detection [J] . Fen Xiao, Wenzheng Deng, Liangchan Peng, Image Processing, IET . 2018 ,第11期

机译：多尺度深度神经网络用于显着目标检测
3. Towards Fast and Accurate Object Detection in Bio-Inspired Spiking Neural Networks Through Bayesian Optimization [J] . Seijoon Kim, Seongsik Park, Byunggook Na, Quality Control, Transactions . 2021 ,第1期

机译：通过贝叶斯优化的生物启发尖峰神经网络快速准确地检测
4. A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [C] . Zhaowei Cai, Quanfu Fan, Rogerio S. Feris, European conference on computer vision . 2016

机译：统一的多尺度深度卷积神经网络用于快速目标检测
5. Deep neural networks and regression models for object detection and pose estimation [D] . Hara, Kota. 2016

机译：用于对象检测和姿态估计的深度神经网络和回归模型
6. Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images [O] . Feng Yang, Wentong Li, Haiwei Hu, 2020

机译：基于多尺度特征集成基于注意力的旋转网络在VHR航空图像中的目标检测
7. MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection [O] . Wenchi Ma, Yuanwei Wu, Zongbo Wang, 2018

机译：MDCN：多尺度，深度卷积卷积神经网络，用于高效对象检测

Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection

摘要

著录项

相似文献

相关主题

期刊订阅