首页> 外文期刊>Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing >Feature refinement for image-based driver action recognition via multi-scale attention convolutional neural network
【24h】

Feature refinement for image-based driver action recognition via multi-scale attention convolutional neural network

机译:通过多尺度关注卷积神经网络的基于图像的驱动器动作识别特征精制

获取原文
获取原文并翻译 | 示例
           

摘要

Driver distraction has currently been a global issue causing the dramatic increase of road accidents and casualties. However, recognizing distracted driving action remains a challenging task in the field of computer vision, since inter-class variations between different driver action categories are quite subtle. To overcome this difficulty, in this paper, a novel deep learning based approach is proposed to extract fine-grained feature representation for image-based driver action recognition. Specifically, we improve the existing convolutional neural network from two aspects: (1) we employ multi-scale convolutional block with different receptive fields of kernel sizes to generate hierarchical feature map and adopt maximum selection unit to adaptively combine multi-scale information; (2) we incorporate an attention mechanism to learn pixel saliency and channel saliency between convolutional features so that it can guide the network to intensify local detail information and suppress global background information. For experiment, we evaluate the designed architecture on multiple driver action datasets. The quantitative experiment result shows that the proposed multi-scale attention convolutional neural network (MSA-CNN) obtains the state of the art performance in image-based driver action recognition.
机译:司机分心目前是一个全球问题,导致道路事故和伤亡的戏剧性增加。然而,识别分散注意力的驾驶动作仍然是计算机视野领域的具有挑战性的任务,因为不同驾驶员动作类别之间的阶级变化非常微妙。为了克服这种困难,本文提出了一种新的基于深度学习的方法来提取基于图像的驾驶员动作识别的细粒度特征表示。具体地,我们从两个方面改进现有的卷积神经网络:(1)我们采用多尺度卷积块,具有不同的内核大小的不同接收领域,以产生分层特征图,并采用最大选择单元以自适应地组合多尺度信息; (2)我们纳入注意机制,以学习卷积功能之间的像素显着性和信道显着性,以便它可以指导网络加强本地详细信息并抑制全球背景信息。对于实验,我们在多个驱动程序行动数据集上评估设计的架构。定量实验结果表明,所提出的多尺度关注卷积神经网络(MSA-CNN)获得基于图像的驱动器动作识别中的最先进性能的状态。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号