首页> 外文会议>International Conference on Document Analysis and Recognition >Fast Method of ID Documents Location and Type Identification for Mobile and Server Application
【24h】

Fast Method of ID Documents Location and Type Identification for Mobile and Server Application

机译:用于移动和服务器应用程序的ID文档定位和类型识别的快速方法

获取原文

摘要

In this paper we discuss the problem of simultaneous document type recognition and projective distortion parameters estimation for the images of ID documents. There are two considered cases. In the first case a video stream captured using mobile devices is processed on the device. The second case considers photos or scanned images which are processed on a server. For each case the requirements are defined for the input data and processing speed. The universal approach is proposed, which allows solving the problem in both cases. The approach is based on representing the image as a constellation of feature points and descriptors, but in order to perform more accurate distortion parameters estimation straight lines and quadrangles are extracted from the input image and used as additional features. Techniques are described which allow to combine matched feature points, lines, and quadrangles to geometric verification using RANSAC. Best alternative selection criteria are proposed along with methods of solution accuracy estimation. The differences between methods of preliminary analysis of the input image and geometric primitives location are discussed in relation to the considered problems. For quality estimation an open dataset MIDV-500 is used, together with its extension for server-side problem version, created in scope of this work. Results show that using lines and quadrangles increase the location accuracy, and the proposed algorithm surpasses previously published works in classification precision and computational performance.
机译:在本文中,我们讨论了ID文档图像的同时文档类型识别和投影失真参数估计的问题。有两种考虑的情况。在第一种情况下,在设备上处理使用移动设备捕获的视频流。第二种情况考虑在服务器上处理过的照片或扫描的图像。对于每种情况,都定义了输入数据和处理速度的要求。提出了通用方法,该方法允许在两种情况下都解决该问题。该方法基于将图像表示为特征点和描述符的星座,但是为了执行更准确的失真参数,从输入图像中提取直线和四边形估计,并将其用作附加特征。描述了允许使用RANSAC将匹配的特征点,线和四边形进行组合以进行几何验证的技术。提出了最佳替代选择标准以及解决方案精度估算的方法。针对所考虑的问题,讨论了输入图像的初步分析方法与几何图元位置之间的差异。为了进行质量评估,使用了开放数据集MIDV-500及其在服务器端问题版本中的扩展名(在此工作范围内创建)。结果表明,使用直线和四边形可以提高定位精度,并且该算法在分类精度和计算性能方面都超过了先前发表的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号