首页> 外文会议>International Conference on Imaging, Signal Processing and Communication >Using Margin-based Region of Interest Technique with Multi-Task Convolutional Neural Network and Template Matching for Robust Face Detection and Tracking System
【24h】

Using Margin-based Region of Interest Technique with Multi-Task Convolutional Neural Network and Template Matching for Robust Face Detection and Tracking System

机译:基于边缘的兴趣区域技术与多任务卷积神经网络和模板匹配的结合,用于鲁棒的人脸检测和跟踪系统

获取原文

摘要

Real-time face detection and tracking systems suffer from low accuracy and slow processing speed that lead to poor robustness. This problem is vital in real-time setups including human robot interactions (HRI) and video analysis systems. This paper presents margin-based region of interest (MROI) approach to speed up the processing time. Further a hybrid approach is also presented that combines Multi-task Convolutional Neural Networks (MTCNN) with template matching to improve face detection accuracy. The MROI approach which is responsible to speed up the processing time is presented in two variants with fixed and dynamic margin concepts. Dataset used in this work comprises of twenty RGB video files. Each video file is fifteen seconds long and been created from real-life videos containing a person in lecture delivering environment. Each video file contains a person in which the person moves, turns head and the camera also moves. The highest face detection and tracking accuracy achieved in this paper is 99.31% with a processing time of 14.93 milliseconds per frame. The proposed hybrid algorithm significantly improves the ability to detect and track faces in sideway orientation apart from frontal face. The proposed algorithm has the ability to process above 65 frames per second (FPS). The system presented has increased FPS processing ability by more than 400% as well as given boost to the accuracy if compared to the conventional MTCNN full frame scanning techniques.
机译:实时面部检测和跟踪系统的准确性较低且处理速度较慢,导致鲁棒性较差。这个问题在包括人机交互(HRI)和视频分析系统在内的实时设置中至关重要。本文提出了基于边际的关注区域(MROI)方法,以加快处理时间。此外,还提出了一种混合方法,该方法将多任务卷积神经网络(MTCNN)与模板匹配相结合,以提高人脸检测的准确性。负责加快处理时间的MROI方法在固定和动态边距概念的两个变体中提出。在这项工作中使用的数据集包括二十个RGB视频文件。每个视频文件的长度为15秒,是根据包含在演讲环境中的人员的真实视频创建的。每个视频文件都包含一个人,该人在其中移动,转头,并且摄像机也移动。本文实现的最高人脸检测和跟踪精度为99.31%,每帧的处理时间为14.93毫秒。所提出的混合算法大大提高了检测和跟踪除正面以外的横向方向上的面部的能力。所提出的算法具有每秒处理65帧以上(FPS)的能力。与传统的MTCNN全帧扫描技术相比,该系统将FPS处理能力提高了400%以上,并提高了准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号