...
首页> 外文期刊>Quality Control, Transactions >Real-Time Detection and Motion Recognition of Human Moving Objects Based on Deep Learning and Multi-Scale Feature Fusion in Video
【24h】

Real-Time Detection and Motion Recognition of Human Moving Objects Based on Deep Learning and Multi-Scale Feature Fusion in Video

机译:基于深度学习和多尺度特征融合的人类移动物体的实时检测与运动识别视频

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

At present, human body moving target detection and recognition algorithms based on deep learning have made breakthrough progress. However, in some applications with high real-time requirements, the existing deep learning real-time detection and recognition network is difficult to achieve high detection accuracy. Therefore, how to achieve accurate positioning and recognition of human moving targets while ensuring real-time detection is still an urgent problem in this field. Based on the single shot multi-box detector (SSD) real-time detection network, this paper proposes a real-time detection positioning and recognition network based on multi-scale feature fusion (IMFF-SSD), which improves the positioning accuracy and identification accuracy. First, this article analyzes the multi-scale features extracted from the SSD network. It combines the position-sensitive information provided by low-level detail features with the context information provided by high-level semantic features through feature fusion, which effectively improves positioning accuracy of the target prediction layer in the SSD network. Secondly, a feature embedded prediction structure is designed to strengthen the semantics of target features without changing the spatial resolution of the SSD prediction layer, and embed low-scale detailed features in high-semantic features for collaborative prediction of targets. This improves the accuracy of the SSD network & x2019;s recognition of human moving targets at all scales. The experimental results show that by combining the above two improvements, the real-time monitoring and recognition network based on multi-scale feature fusion proposed in this paper has achieved a greater degree of improvement in positioning accuracy and motion recognition accuracy than the original SSD, which is better than some current the human body moving object detection and recognition algorithm has great advantages.
机译:目前,基于深度学习的人体移动目标检测和识别算法取得了突破性。然而,在具有高实时要求的应用中,现有的深度学习实时检测和识别网络难以实现高检测精度。因此,如何实现人类移动目标的准确定位和识别,同时确保实时检测仍然是该领域的紧急问题。基于单次拍摄多箱检测器(SSD)实时检测网络,本文提出了一种基于多尺度特征融合(IMFF-SSD)的实时检测定位和识别网络,从而提高了定位精度和识别准确性。首先,本文分析了从SSD网络中提取的多尺度功能。它结合了低级细节特征提供的位置敏感信息,其中通过通过特征融合提供了高电平语义特征提供的上下文信息,从而有效提高了SSD网络中的目标预测层的定位精度。其次,设计特征嵌入式预测结构以增强目标特征的语义,而不改变SSD预测层的空间分辨率,并且在高语义特征中嵌入低规模的详细特征以进行目标的协作预测。这提高了SSD网络的准确性和X2019; S在所有尺度上识别人类移动目标的识别。实验结果表明,通过组合上述两种改进,本文提出的基于多尺度特征融合的实时监测和识别网络已经实现了比原始SSD的定位精度和运动识别精度的更大改善程度。这比一些电流更好的人体移动物体检测和识别算法具有很大的优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号