...
首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection
【24h】

Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection

机译:多尺度注意力深度神经网络用于快速准确的目标检测

获取原文
获取原文并翻译 | 示例

摘要

Object detection remains a challenging task in computer vision due to the tremendous extent of changes in the appearances of objects caused by clustered backgrounds, occlusion, truncation, and scale change. Current deep neural network (DNN)-based object detection methods cannot simultaneously achieve a high accuracy and a high efficiency. To overcome this limitation, in this paper, we propose a novel multi-scale attention (MSA) DNN for accurate object detection with high efficiency. The proposed MSA-DNN method utilizes a novel multi-scale feature fusion module (MSFFM) to construct high-level semantic features. Subsequently, a novel MSA module (MSAM) based on the fused layers of the MSFFM is introduced to exploit the global semantic information of image-level labels to guide detection. On the one hand, MSAM can capture global semantic information to further enhance the semantic feature representation of the fused layers constructed by the MSFFM, thereby improving the detection accuracy. On the other hand, the MSA maps generated by MSAM can be employed to rapidly and coarsely locate objects at different scales. In addition, an attention-based hard negative mining strategy is introduced to filter out negative samples to reduce the search space, dramatically alleviating the severe class imbalance problem. Extensive experimental results on the challenging PASCAL VOC 2007, PASCAL VOC 2012, and MS COCO datasets demonstrate that MSA-DNN achieves a state-of-the-art detection accuracy while maintaining a high efficiency. Furthermore, MSA-DNN significantly improves the small-object detection accuracy.
机译:由于群集背景,遮挡,截断和缩放变化导致的对象外观变化很大,因此对象检测在计算机视觉中仍然是一项具有挑战性的任务。当前基于深度神经网络(DNN)的对象检测方法无法同时实现高精度和高效率。为了克服这一限制,在本文中,我们提出了一种新颖的多尺度注意力(MSA)DNN,可以高效地进行精确的目标检测。提出的MSA-DNN方法利用新颖的多尺度特征融合模块(MSFFM)来构建高级语义特征。随后,引入了一种基于MSFFM融合层的新颖MSA模块(MSAM),以利用图像级标签的全局语义信息来指导检测。一方面,MSAM可以捕获全局语义信息,以进一步增强MSFFM构造的融合层的语义特征表示,从而提高检测精度。另一方面,由MSAM生成的MSA映射可用于快速粗略地定位不同比例的对象。另外,引入了一种基于注意的硬否定挖掘策略,以过滤出否定样本以减少搜索空间,从而大大缓解了严重的类不平衡问题。在具有挑战性的PASCAL VOC 2007,PASCAL VOC 2012和MS COCO数据集上进行的大量实验结果表明,MSA-DNN在保持高效率的同时达到了最先进的检测精度。此外,MSA-DNN大大提高了小物体检测的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号